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Zur Megede, Jan 

<120> POLYNUCLEOTIDES ENCODING ANTIGENIC HIV TYPE C 
POLYPEPTIDES, POLYPEPTIDES AND USES THEREOF 

<130> PP01631.101 

<140> 
<141> 

<150> 09/475,704 
<151> 1999-12-30 

<160> 45 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 1 

gacatcaagc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 60 

<210> 2 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 2 

gacatccgcc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 60 

<210> 3 
<211> 1479 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Gag 
of HIV strain AF110965 

<400> 3 

atgggcgccc gcgccagcat cctgcgcggc ggcaagctgg acgcctggga gcgcatccgc 60 
ctgcgccccg gcggcaagaa gtgctacatg atgaagcacc tggtgtgggc cagccgcgag 12 0 
ctggagaagt tcgccctgaa ccccggcctg ctggagacca gcgagggctg caagcagatc 18 0 
atccgccagc tgcaccccgc cctgcagacc ggcagcgagg agctgaagag cctgttcaac 240 
accgtggcca ccctgtactg cgtgcacgag aagatcgagg tccgcgacac caaggaggcc 3 00 
ctggacaaga tcgaggagga gcagaacaag tgccagcaga agatccagca ggccgaggcc 3 60 
gccgacaagg gcaaggtgag ccagaactac cccatcgtgc agaacctgca gggccagatg 42 0 
gtgcaccagg ccatcagccc ccgcaccctg aacgcctggg tgaaggtgat cgaggagaag 4 80 



1 



0 



gccttcagcc ccgaggtgat ccccatgttc accgccctga gcgagggcgc caccccccag 54 0 
gacctgaaca cgatgttgaa caccgtgggc ggccaccagg ccgccatgca gatgctgaag 600 
gacaccatca acgaggaggc cgccgagtgg gaccgcgtgc accccgtgca cgccggcccc 660 
atcgcccccg gccagatgcg cgagccccgc ggcagcgaca tcgccggcac caccagcacc 72 0 
ctgcaggagc agatcgcctg gatgaccagc aaccccccca tccccgtggg cgacatctac 780 
aagcggtgga tcatcctggg cctgaacaag atcgtgcgga tgtacagccc cgtgagcatc 84 0 
ctggacatca agcagggccc caaggagccc ttccgcgact acgtggaccg cttcttcaag 900 
accctgcgcg ccgagcagag cacccaggag gtgaagaact ggatgaccga caccctgctg 960 
gtgcagaacg ccaaccccga ctgcaagacc atcctgcgcg ctctcggccc cggcgccagc 1020 
ctggaggaga tgatgaccgc ctgccagggc gtgggcggcc ccagccacaa ggcccgcgtg 1080 
ctggccgagg cgatgagcca ggccaacacc agcgtgatga tgcagaagag caacttcaal 1140 
ggcccccggc gcatcgtcaa gtgcttcaac tgcggcaagg agggccacat cgcccgcaac 1200 
aactar^r'' ^^"^^^^^^^ gggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 
gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccqc 1320 
cccggcaact tcctgcagag ccgccccgag cccaccgccc cccccgccga gagS^ccgc 1380 
ttcgaggaga ccacccccgg ccagaagcag gagagcaagg accgcgagac cctgaccagc 1440 
ctgaagagcc tgttcggcaa cgaccccctg agccagtaa ^ 1479 

<210> 4 
<211> 1509 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Gag 
of HIV strain AF110967 



<400> 4 

atgggcgccc gcgccagcat cctgcgcggc 
ctgcgccccg gcggcaagaa gcactacatg 
ctggagggct tcgccctgaa ccccggcctg 
atgaagcagc tgcagcccgc cctgcagacc 
accgtggcca ccctgtactg cgtgcacgcc 
ctggacaaga tcgaggagga gcagaacaag 
gccgacggca aggtgagcca gaactacccc 
caccaggcca tcagcccccg caccctgaac 
ttcagccccg aggtgatccc catgttcacc 
ctgaacacga tgttgaacac cgtgggcggc 
accatcaacg aggaggccgc cgagtgggac 
gcccccggcc agatgcgcga cccccgcggc 
caggagcaga M:cgcctggat gaccagcaac 
cggtggatca tcctgggcct gaacaagatc 
gacatccgcc agggccccaa ggagcccttc 
ctgcgcgccg agcaggccac ccaggacgtg 
cagaacgcca accccgactg caagaccatc 
gaggagatga tgaccgcctg ccagggcgtg 
gccgaggcga tgagccaggc caacagcgtg 
ggcccccggc gcaacgtcaa gtgcttcaac 
tgccgcgccc cccgcaagaa gggctgctgg 
gactgcaccg agcgccaggc caacttcctg 
cccggcaact tcctgcagaa ccgcagcgag 
cccgccgaga gcttccgctt cgaggagacc 
cgcgagccct accgcgagcc cctgaccgcc 
agccagtaa 



gagaagctgg acaagtggga gaagatccgc 60 
ctgaagcacc tggtgtgggc cagccgcgag 12 0 
ctggagaccg ccgagggctg caagcagatc 18 0 
ggcaccgagg agctgcgcag cctgtacaac 24 0 
ggcatcgagg tccgcgacac caaggaggcc 3 00 
tcccagcaga agacccagca ggccaaggag 3 60 
atcgtgcaga acctgcaggg ccagatggtg 42 0 
gcctgggtga aggtgatcga ggagaaggcc 4 80 
gccctgagcg agggcgccac cccccaggac 54 0 
caccaggccg ccatgcagat gctgaaggac 600 
cgcctgcacc ccgtgcaggc cggccccgtg 660 
agcgacatcg ccggcgccac cagcaccctg 72 0 
ccccccgtgc ccgtgggcga catctacaag 780 
gtgcggatgt acagccccgt gagcatcctg 84 0 
cgcgactacg tggaccgctt cttcaagacc 900 
aagaactgga tgaccgagac cctgctggtg 960 
ctgcgcgctc tcggccccgg cgccaccctg 102 0 
ggcggccccg gccacaaggc ccgcgtgctg 108 0 
aacatcatga tgcagaagag caacttcaag 114 0 
tgcggcaagg agggccacat cgccaagaac 12 0 0 
aagtgcggca aggagggcca ccagatgaag 1260 
ggcaagatct ggcccagcca caagggccgc 1320 
cccgccgccc ccaccgtgcc caccgccccc 13 80 
acccccgccc ccaagcagga gcccaaggac 1440 
ctgcgcagcc tgttcggcag cggccccctg 1500 

1509 



2 




<210> 5 
<211> 141 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Env common 
region of HIV strain AF110968 

<400> 5 

accatcacca tcacctgccg catcaagcag atcatcaaca tgtggcagaa ggtgggccgc 60 
gccatgtacg ccccccccat cgccggcaac ctgacctgcg agagcaacat caccggcctg 12 0 
ctgctgaccc gcgacggcgg c 141 

<210> 6 
<211> 1431 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
gpl20 coding region of HIV strain AF110968 



<400> 6 

agcgtggtgg gcaacctgtg ggtgaccgtg tactacggcg tgcccgtgtg gaaggaggcc 60 
aagaccaccc tgttctgcac cagcgacgcc aaggcctacg agaccgaggt gcacaacgtg 12 0 
tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcgt gctggagaac 180 
gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 24 0 
atcagcctgt gggaccagag cctgaagccc tgcgtgaagc tgacccccct gtgcgtgacc 3 00 
ctgaagtgcc gcaacgtgaa cgccaccaac aacatcaaca gcatgatcga caacagcaac 360 
aagggcgaga tgaagaactg cagcttcaac gtgaccaccg agctgcgcga ccgcaagcag 42 0 
gaggtgcacg ccctgttcta ccgcctggac gtggtgcccc tgcagggcaa caacagcaac 480 
gagtaccgcc tgatcaactg caacaccagc gccatcaccc aggcctgccc caaggtgagc 540 
ttcgacccca tccccatcca ctactgcacc cccgccggct acgccatcct gaagtgcaac 600 
aaccagacct tcaacggcac cggcccctgc aacaacgtga gcagcgtgca gtgcgcccac 660 
ggcatcaagc ccgtggtgag cacccagctg ctgctgaacg gcagcctggc caagggcgag 72 0 
atcatcatcc gcagcgagaa cctggccaac aacgccaaga tcatcatcgt gcagctgaac 780 
aagcccgtga agatcgtgtg cgtgcgcccc aacaacaaca cccgcaagag cgtgcgcatc 84 0 
ggccccggcc agaccttcta cgccaccggc gagatcatcg gcgacatccg ccaggcctac 900 
tgcatcatca acaagaccga gtggaacagc accctgcagg gcgtgagcaa gaagctggag 960 
gagcacttca gcaagaaggc catcaagttc gagcccagca gcggcggcga cctggagatc 102 0 
accacccaca gcttcaactg ccgcggcgag ttcttctact gcgacaccag ccagctgttc 1080 
aacagcacct acagccccag cttcaacggc accgagaaca agctgaacgg caccatcacc 1140 
atcacctgcc gcatcaagca gatcatcaac atgtggcaga aggtgggccg cgccatgtac 1200 
gcccccccca tcgccggcaa cctgacctgc gagagcaaca tcaccggcct gctgctgacc 1260 
cgcgacggcg gcaagaccgg ccccaacgac accgagatct tccgccccgg cggcggcgac 1320 
atgcgcgaca actggcgcaa cgagctgtac aagtacaagg tggtggagat caagcccctg 13 80 
ggcgtggccc ccaccgaggc caagcgccgc gtggtggagc gcgagaagcg c 1431 

<210> 7 
<211> 1944 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: synthetic 
gpl40 coding region of HIV strain AF110968 

<400> 7 

agcgtggtgg gcaacctgtg ggtgaccgtg tactacggcg tgcccgtgtg gaaggaggcc 60 
aagaccaccc tgttctgcac cagcgacgcc aaggcctacg agaccgaggt gcacaacgtg 12 0 
tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcgt gctggagaac 180 
gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 24 0 
atcagcctgt gggaccagag cctgaagccc tgcgtgaagc tgacccccct gtgcgtgacc 300 
ctgaagtgcc gcaacgtgaa cgccaccaac aacatcaaca gcatgatcga caacagcaac 360 
aagggcgaga tgaagaactg cagcttcaac gtgaccaccg agctgcgcga ccgcaagcag 42 0 
gaggtgcacg ccctgttcta ccgcctggac gtggtgcccc tgcagggcaa caacagcaac 480 
gagtaccgcc tgatcaactg caacaccagc gccatcaccc aggcctgccc caaggtgagc 540 
ttcgacccca tccccatcca ctactgcacc cccgccggct acgccatcct gaagtgcaac 600 
aaccagacct tcaacggcac cggcccctgc aacaacgtga gcagcgtgca gtgcgcccac 660 
ggcatcaagc ccgtggtgag cacccagctg ctgctgaacg gcagcctggc caagggcgag 72 0 
atcatcatcc gcagcgagaa cctggccaac aacgccaaga tcatcatcgt gcagctgaac 780 
aagcccgtga agatcgtgtg cgtgcgcccc aacaacaaca cccgcaagag cgtgcgcatc 840 
ggccccggcc agaccttcta cgccaccggc gagatcatcg gcgacatccg ccaggcctac 900 
tgcatcatca acaagaccga gtggaacagc accctgcagg gcgtgagcaa gaagctggag 960 
gagcacttca gcaagaaggc catcaagttc gagcccagca gcggcggcga cctggagatc 1020 
accacccaca gcttcaactg ccgcggcgag ttcttctact gcgacaccag ccagctgttc 1080 
aacagcacct acagccccag cttcaacggc accgagaaca agctgaacgg caccatcacc 1140 
atcacctgcc gcatcaagca gatcatcaac atgtggcaga aggtgggccg cgccatgtac 12 00 
gcccccccca tcgccggcaa cctgacctgc gagagcaaca tcaccggcct gctgctgacc 1260 
cgcgacggcg gcaagaccgg ccccaacgac accgagatct tccgccccgg cggcggcgac 13 2 0 
atgcgcgaca actggcgcaa cgagctgtac aagtacaagg tggtggagat caagcccctg 13 80 
ggcgtggccc ccaccgaggc caagcgccgc gtggtggagc gcgagaagcg cgccgtgggc 1440 
atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 15 00 
atcaccctga ccgtgcaggc ccgcctgctg ctgagcggca tcgtgcagca gcagaacaac 1560 
ctgctgcgcg ccatcgaggc ccagcagcac ctgctgcagc tgaccgtgtg gggcatcaag 162 0 
cagctgcaga cccgcatcct ggccgtggag cgctacctga aggaccagca gctgctgggc 1680 
atctggggct gcagcggcaa gctgatctgc accaccgccg tgccctggaa cagcagctgg 1740 
agcaaccgca gccacgacga gatctgggac aacatgacct ggatgcagtg ggaccgcgag 1800 
atcaacaact acaccgacac catctaccgc ctgctggagg agagccagaa ccagcaggag 1860 
aagaacgaga aggacctgct ggccctggac agctggcaga acctgtggaa ctggttcagc 192 0 
atcaccaact ggctgtggta catc 1944 

<210> 8 
<211> 2466 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
gpl60 coding region of HIV strain AF110968 



<400> 8 

agcgtggtgg gcaacctgtg ggtgaccgtg 
aagaccaccc tgttctgcac cagcgacgcc 
tgggccaccc acgcctgcgt gcccaccgac 
gtgaccgaga acttcaacat gtggaagaac 
atcagcctgt gggaccagag cctgaagccc 



tactacggcg tgcccgtgtg gaaggaggcc 60 
aaggcctacg agaccgaggt gcacaacgtg 12 0 
cccaaccccc aggagatcgt gctggagaac 180 
gacatggtgg accagatgca cgaggacatc 24 0 
tgcgtgaagc tgacccccct gtgcgtgacc 300 
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ctgaagtgcc gcaacgtgaa cgccaccaac aacatcaaca gcatgatcga caacagcaac 3 60 
aagggcgaga tgaagaactg cagcttcaac gtgaccaccg agctgcgcga ccgcaagcag 420 
gaggtgcacg ccctgttcta ccgcctggac gtggtgcccc tgcagggcaa caacagcaac 4 80 
gagtaccgcc tgatcaactg caacaccagc gccatcaccc aggcctgccc caaggtgagc 540 
ttcgacccca tccccatcca ctactgcacc cccgccggct acgccatcct gaagtgcaac 600 
aaccagacct tcaacggcac cggcccctgc aacaacgtga gcagcgtgca gtgcgcccac 660 
ggcatcaagc ccgtggtgag cacccagctg ctgctgaacg gcagcctggc caagggcgag 720 
atcatcatcc gcagcgagaa cctggccaac aacgccaaga tcatcatcgt gcagctgaac 780 
aagcccgtga agatcgtgtg cgtgcgcccc aacaacaaca cccgcaagag cgtgcgcatc 840 
ggccccggcc agaccttcta cgccaccggc gagatcatcg gcgacatccg ccaggcctac 900 
tgcatcatca acaagaccga gtggaacagc accctgcagg gcgtgagcaa gaagctggag 960 
gagcacttca gcaagaaggc catcaagttc gagcccagca gcggcggcga cctggagatc 102 0 
accacccaca gcttcaactg ccgcggcgag ttcttctact gcgacaccag ccagctgttc 1080 
aacagcacct acagccccag cttcaacggc accgagaaca agctgaacgg caccatcacc 1140 
atcacctgcc gcatcaagca gatcatcaac atgtggcaga aggtgggccg cgccatgtac 1200 
gcccccccca tcgccggcaa cctgacctgc gagagcaaca tcaccggcct gctgctgacc 1260 
cgcgacggcg gcaagaccgg ccccaacgac accgagatct tccgccccgg cggcggcgac 132 0 
atgcgcgaca actggcgcaa cgagctgtac aagtacaagg tggtggagat caagcccctg 1380 
ggcgtggccc ccaccgaggc caagcgccgc gtggtggagc gcgagaagcg cgccgtgggc 144 0 
atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 15 0 0 
atcaccctga ccgtgcaggc ccgcctgctg ctgagcggca tcgtgcagca gcagaacaac 1560 
ctgctgcgcg ccatcgaggc ccagcagcac ctgctgcagc tgaccgtgtg gggcatcaag 162 0 
cagctgcaga cccgcatcct ggccgtggag cgctacctga aggaccagca gctgctgggc 1680 
atctggggct gcagcggcaa gctgatctgc accaccgccg tgccctggaa cagcagctgg 1740 
agcaaccgca gccacgacga gatctgggac aacatgacct ggatgcagtg ggaccgcgag 1800 
atcaacaact acaccgacac catctaccgc ctgctggagg agagccagaa ccagcaggag 18 60 
aagaacgaga aggacctgct ggccctggac agctggcaga acctgtggaa ctggttcagc 192 0 
atcaccaact ggctgtggta catcaagatc ttcatcatga tcgtgggcgg cctgatcggc 1980 
ctgcgcatca tcttcgccgt gctgagcatc gtgaaccgcg tgcgccaggg ctacagcccc 2040 
ctgcccttcc agaccctgac ccccaacccc cgcgagcccg accgcctggg ccgcatcgag 2100 
gaggagggcg gcgagcagga ccgcggccgc agcatccgcc tggtgagcgg cttcctggcc 2160 
ctggcctggg acgacctgcg cagcctgtgc ctgttcagct accaccgcct gcgcgacttc 2220 
atcctgatcg ccgcccgcgt gctggagctg ctgggccagc gcggctggga ggccctgaag 2280 
tacctgggca gcctggtgca gtactggggc ctggagctga agaagagcgc catcagcctg 2340 
ctggacacca tcgccatcgc cgtggccgag ggcaccgacc gcatcatcga gttcatccag 2400 
cgcatctgcc gcgccatccg caacatcccc cgccgcatcc gccagggctt cgaggccgcc 2460 
^^^^^^ 2466 

<210> 9 
<211> 2547 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence and gpl60 coding region of HIV 
strain AF110968 



<400> 9 

atgcgcgtga tgggcatcct gaagaactac 
ttctggatgc tgatcatcag cagcgtggtg 
gtgcccgtgt ggaaggaggc caagaccacc 
gagaccgagg tgcacaacgt gtgggccacc 
caggagatcg tgctggagaa cgtgaccgag 



cagcagtggt ggatgtgggg catcctgggc 60 
ggcaacctgt gggtgaccgt gtactacggc 12 0 
ctgttctgca ccagcgacgc caaggcctac 180 
cacgcctgcg tgcccaccga ccccaacccc 240 
aacttcaaca tgtggaagaa cgacatggtg 3 00 



5 



gaccagatgc acgaggacat catcagcctg tgggaccaga gcctgaagcc ctgcgtgaag 360 
ctgacccccc tgtgcgtgac cctgaagtgc cgcaacgtga acgccaccaa caacatcaac 42 0 
agcatgatcg acaacagcaa caagggcgag atgaagaact gcagcttcaa cgtgaccacc 480 
gagctgcgcg accgcaagca ggaggtgcac gccctgttct accgcctgga cgtggtgccc 540 
ctgcagggca acaacagcaa cgagtaccgc ctgatcaact gcaacaccag cgccatcacc 600 
caggcctgcc ccaaggtgag cttcgacccc atccccatcc actactgcac ccccgccggc 660 
tacgccatcc tgaagtgcaa caaccagacc ttcaacggca ccggcccctg caacaacgtg 72 0 
agcagcgtgc agtgcgccca cggcatcaag cccgtggtga gcacccagct gctgctgaac 780 
ggcagcctgg ccaagggcga gatcatcatc cgcagcgaga acctggccaa caacgccaag 84 0 
atcatcatcg tgcagctgaa caagcccgtg aagatcgtgt gcgtgcgccc caacaacaac 900 
acccgcaaga gcgtgcgcat cggccccggc cagaccttct acgccaccgg cgagatcatc 960 
ggcgacatcc gccaggccta ctgcatcatc aacaagaccg agtggaacag caccctgcag 1020 
ggcgtgagca agaagctgga ggagcacttc agcaagaagg ccatcaagtt cgagcccagc 1080 
agcggcggcg acctggagat caccacccac agcttcaact gccgcggcga gttcttctac 1140 
tgcgacacca gccagctgtt caacagcacc tacagcccca gcttcaacgg caccgagaac 12 00 
aagctgaacg gcaccatcac catcacctgc cgcatcaagc agatcatcaa catgtggcag 12 60 
aaggtgggcc gcgccatgta cgcccccccc atcgccggca acctgacctg cgagagcaac 1320 
atcaccggcc tgctgctgac ccgcgacggc ggcaagaccg gccccaacga caccgagatc 1380 
ttccgccccg gcggcggcga catgcgcgac aactggcgca acgagctgta caagtacaag 1440 
gtggtggaga tcaagcccct gggcgtggcc cccaccgagg ccaagcgccg cgtggtggag 1500 
cgcgagaagc gcgccgtggg catcggcgcc gtgttcctgg gcttcctggg cgccgccggc 1560 
agcaccatgg gcgccgccag catcaccctg accgtgcagg cccgcctgct gctgagcggc 1620 
atcgtgcagc agcagaacaa cctgctgcgc gccatcgagg cccagcagca cctgctgcag 1680 
ctgaccgtgt ggggcatcaa gcagctgcag acccgcatcc tggccgtgga gcgctacctg 1740 
aaggaccagc agctgctggg catctggggc tgcagcggca agctgatctg caccaccgcc 1800 
gtgccctgga acagcagctg gagcaaccgc agccacgacg agatctggga caacatgacc 1860 
tggatgcagt gggaccgcga gatcaacaac tacaccgaca ccatctaccg cctgctggag 1920 
gagagccaga accagcagga gaagaacgag aaggacctgc tggccctgga cagctggcag 1980 
aacctgtgga actggttcag catcaccaac tggctgtggt acatcaagat cttcatcatg 2040 
atcgtgggcg gcctgatcgg cctgcgcatc atcttcgccg tgctgagcat cgtgaaccgc 2100 
gtgcgccagg gctacagccc cctgcccttc cagaccctga cccccaaccc ccgcgagccc 2160 
gaccgcctgg gccgcatcga ggaggagggc ggcgagcagg accgcggccg cagcatccgc 2220 
ctggtgagcg gcttcctggc cctggcctgg gacgacctgc gcagcctgtg cctgttcagc 2280 
taccaccgcc tgcgcgactt catcctgatc gccgcccgcg tgctggagct gctgggccag 2340 
cgcggctggg aggccctgaa gtacctgggc agcctggtgc agtactgggg cctggagctg 2400 
aagaagagcg ccatcagcct gctggacacc atcgccatcg ccgtggccga gggcaccgac 2460 
cgcatcatcg agttcatcca gcgcatctgc cgcgccatcc gcaacatccc ccgccgcatc 252 0 
cgccagggct tcgaggccgc cctgcag 2547 

<210> 10 
<211> 1035 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic a 
gp41 coding region of HIV strain AF110968 

<400> 10 

gccgtgggca tcggcgccgt gttcctgggc ttcctgggcg ccgccggcag -caccatgggc 60 
gccgccagca tcaccctgac cgtgcaggcc cgcctgctgc tgagcggcat cgtgcagcag 120 
cagaacaacc tgctgcgcgc catcgaggcc cagcagcacc tgctgcagct gaccgtgtgg 180 
ggcatcaagc agctgcagac ccgcatcctg gccgtggagc gctacctgaa ggaccagcag 240 
ctgctgggca tctggggctg cagcggcaag ctgatctgca ccaccgccgt gccctggaac 300 
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agcagctgga gcaaccgcag ccacgacgag atctgggaca acatgacctg gatgcagtgg 360 
gaccgcgaga tcaacaacta caccgacacc atctaccgcc tgctggagga gagccagaac 42 0 
cagcaggaga agaacgagaa ggacctgctg gccctggaca gctggcagaa cctgtggaac 4 80 
tggttcagca tcaccaactg gctgtggtac atcaagatct tcatcatgat cgtgggcggc 540 
ctgatcggcc tgcgcatcat cttcgccgtg ctgagcatcg tgaaccgcgt gcgccagggc 600 
tacagccccc tgcccttcca gaccctgacc cccaaccccc gcgagcccga ccgcctgggc 660 
cgcatcgagg aggagggcgg cgagcaggac cgcggccgca gcatccgcct ggtgagcggc 72 0 
ttcctggccc tggcctggga cgacctgcgc agcctgtgcc tgttcagcta ccaccgcctg 780 
cgcgacttca tcctgatcgc cgcccgcgtg ctggagctgc tgggccagcg cggctgggag 840 
gccctgaagt acctgggcag cc tggtgcag tactggggcc tggagctgaa gaagagcgcc 900 
atcagcctgc tggacaccat cgccatcgcc gtggccgagg gcaccgaccg catcatcgag 960 
ttcatccagc gcatctgccg cgccatccgc aacatccccc gccgcatccg ccagggcttc 1020 
gaggccgccc tgcag 1035 



<210> 11 

<211> 144 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic Env 
common region of HIV strain AF110975 



<400> 11 

agcatcatca ccctgccctg ccgcatcaag cagatcatcg acatgtggca gaaggtgggc 60 
cgcgccatct acgccccccc catcgagggc aacatcacct gcagcagcag catcaccggc 120 
ctgctgctgg cccgcgacgg cggc 144 

<210> 12 
<211> 1437 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic 
gpl20 coding region of HIV strain AF110975 



<400> 12 

agcggcctgg gcaacctgtg ggtgaccgtg tacgacggcg tgcccgtgtg gcgcgaggcc 60 
agcaccaccc tgttctgcgc cagcgacgcc aaggcctacg agaaggaggt gcacaacgtg 12 0 
tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcga gctggacaac 180 
gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 240 
atcagcctgt gggaccagag cctgaagccc cgcgtgaagc tgacccccct gtgcgtgacc 3 00 
ctgaagtgca ccaactacag caccaactac agcaacacca tgaacgccac cagctacaac 3 60 
aacaacacca ccgaggagat caagaactgc accttcaaca tgaccaccga gctgcgcgac 420 
aagaagcagc aggtgtacgc cctgttctac aagctggaca tcgtgcccct gaacagcaac 4 80 
agcagcgagt accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag 540 
gtgagcttcg accccatccc catccactac tgcgcccccg ccggctacgc catcctgaag 600 
tgcaagaaca acaccagcaa cggcaccggc ccctgccaga acgtgagcac cgtgcagtgc 660 
acccacggca tcaagcccgt ggtgagcacc cccctgctgc tgaacggcag cctggccgag 720 
ggcggcgaga tcatcatccg cagcaagaac ctgagcaaca acgcctacac catcatcgtg 780 
cacctgaacg acagcgtgga gatcgtgtgc acccgcccca acaacaacac ccgcaagggc 840 
atccgcatcg gccccggcca gaccttctac gccaccgaga acatcatcgg cgacatccgc 900 
caggcccact gcaacatcag cgccggcgag tggaacaagg ccgtgcagcg cgtgagcgcc 960 
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aagctgcgcg agcacttccc caacaagacc atcgagttcc agcccagcag cggcggcgac 102 0 
ctggagatca ccacccacag cttcaactgc cgcggcgagt tcttctactg caacaccagc 1080 
aagctgttca acagcagcta caacggcacc agctaccgcg gcaccgagag caacagcagc 114 0 
atcatcaccc tgccctgccg catcaagcag atcatcgaca tgtggcagaa ggtgggccgc 1200 
gccatctacg ccccccccat cgagggcaac atcacctgca gcagcagcat caccggcctg 1260 
ctgctggccc gcgacggcgg cctggacaac atcaccaccg agatcttccg cccccagggc 132 0 
ggcgacatga aggacaactg gcgcaacgag ctgtacaagt acaaggtggt ggagatcaag 138 0 
cccctgggcg tggcccccac cgaggccaag cgccgcgtgg tggagcgcga gaagcgc 143 7 

<210> 13 
<211> 1950 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
gpl40 coding region of HIV strain AF110975 

<400> 13 

agcggcctgg gcaacctgtg ggtgaccgtg tacgacggcg tgcccgtgtg gcgcgaggcc 60 
agcaccaccc tgttctgcgc cagcgacgcc aaggcctacg agaaggaggt gcacaacgtg 120 
tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcga gctggacaac 180 
gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 240 
atcagcctgt gggaccagag cctgaagccc cgcgtgaagc tgacccccct gtgcgtgacc 3 00 
ctgaagtgca ccaactacag caccaactac agcaacacca tgaacgccac cagctacaac 3 SO 
aacaacacca ccgaggagat caagaactgc accttcaaca tgaccaccga gctgcgcgac 42 0 
aagaagcagc aggtgtacgc cctgttctac aagctggaca tcgtgcccct gaacagcaac 480 
agcagcgagt accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag 540 
gtgagcttcg accccatccc catccactac tgcgcccccg ccggctacgc catcctgaag 600 
tgcaagaaca acaccagcaa cggcaccggc ccctgccaga acgtgagcac cgtgcagtgc 650 
acccacggca tcaagcccgt ggtgagcacc cccctgctgc tgaacggcag cctggccgag 72 0 
ggcggcgaga tcatcatccg cagcaagaac ctgagcaaca acgcctacac catcatcgtg 780 
cacctgaacg acagcgtgga gatcgtgtgc acccgcccca acaacaacac ccgcaagggc 84 0 
atccgcatcg gccccggcca gaccttctac gccaccgaga acatcatcgg cgacatccgc 900 
caggcccact gcaacatcag cgccggcgag tggaacaagg ccgtgcagcg cgtgagcgcc 960 
aagctgcgcg agcacttccc caacaagacc atcgagttcc agcccagcag cggcggcgac 102 0 
ctggagatca ccacccacag cttcaactgc cgcggcgagt tcttctactg caacaccagc 1080 
aagctgttca acagcagcta caacggcacc agctaccgcg gcaccgagag caacagcagc 1140 
atcatcaccc tgccctgccg catcaagcag atcatcgaca tgtggcagaa ggtgggccgc 12 00 
gccatctacg ccccccccat cgagggcaac atcacctgca gcagcagcat caccggcctg 1260 
ctgctggccc gcgacggcgg cctggacaac atcaccaccg agatcttccg cccccagggc 132 0 
ggcgacatga aggacaactg gcgcaacgag ctgtacaagt acaaggtggt ggagatcaag 138 0 
cccctgggcg tggcccccac cgaggccaag cgccgcgtgg tggagcgcga gaagcgcgcc 144 0 
gtgggcatcg gcgccgtgat cttcggcttc ctgggcgccg ccggcagcaa catgggcgcc 1500 
gccagcatca ccctgaccgc ccaggcccgc cagctgctga gcggcatcgt gcagcagcag 1560 
agcaacctgc tgcgcgccat cgaggcccag cagcacatgc tgcagctgac cgtgtggggc 162 0 
atcaagcagc tgcaggcccg cgtgctggcc atcgagcgct acctgaagga ccagcagctg 1680 
ctgggcatct ggggctgcag cggcaagctg atctgcacca ccaccgtgcc ctggaacagc 174 0 
agctggagca acaagaccca gggcgagatc tgggagaaca tgacctggat gcagtgggac 1800 
aaggagatca gcaactacac cggcatcatc taccgcctgc tggaggagag ccagaaccag 1860 
caggagcaga acgagaagga cctgctggcc ctggacagcc gcaacaacct gtggagctgg 1920 
ttcaacatca gcaactggct gtggtacatc a a yy xr^u 

<210> 14 
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<211> 2493 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic 
gpl60 coding region of HIV strain AF110975 

<400> 14 

agcggcctgg gcaacctgtg ggtgaccgtg tacgacggcg tgcccgtgtg gcgcgaggcc 60 
agcaccaccc tgttctgcgc cagcgacgcc aaggcctacg agaaggaggt gcacaacgtg 12 0 
tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcga gctggacaac 180 
gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 24 0 
atcagcctgt gggaccagag cctgaagccc cgcgtgaagc tgacccccct gtgcgtgacc 3 00 
ctgaagtgca ccaactacag caccaactac agcaacacca tgaacgccac cagctacaac 3 60 
aacaacacca ccgaggagat caagaactgc accttcaaca tgaccaccga gctgcgcgac 420 
aagaagcagc aggtgtacgc cctgttctac aagctggaca tcgtgcccct gaacagcaac 480 
agcagcgagt accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag 540 
gtgagcttcg accccatccc catccactac tgcgcccccg ccggctacgc catcctgaag 600 
tgcaagaaca acaccagcaa cggcaccggc ccctgccaga acgtgagcac cgtgcagtgc 660 
acccacggca tcaagcccgt ggtgagcacc cccctgctgc tgaacggcag cctggccgag 720 
ggcggcgaga tcatcatccg cagcaagaac ctgagcaaca acgcctacac catcatcgtg 780 
cacctgaacg acagcgtgga gatcgtgtgc acccgcccca acaacaacac ccgcaagggc 84 0 
atccgcatcg gccccggcca gaccttctac gccaccgaga acatcatcgg cgacatccgc 900 
caggcccact gcaacatcag cgccggcgag tggaacaagg ccgtgcagcg cgtgagcgcc 960 
aagctgcgcg agcacttccc caacaagacc atcgagttcc agcccagcag cggcggcgac 1020 
ctggagatca ccacccacag cttcaactgc cgcggcgagt tcttctactg caacaccagc 1080 
aagctgttca acagcagcta caacggcacc agctaccgcg gcaccgagag caacagcagc 1140 
atcatcaccc tgccctgccg catcaagcag atcatcgaca tgtggcagaa ggtgggccgc 1200 
gccatctacg ccccccccat cgagggcaac atcacctgca gcagcagcat caccggcctg 12 60 
ctgctggccc gcgacggcgg cctggacaac atcaccaccg agatcttccg cccccagggc 1320 
ggcgacatga aggacaactg gcgcaacgag ctgtacaagt acaaggtggt ggagatcaag 1380 
cccctgggcg tggcccccac cgaggccaag cgccgcgtgg tggagcgcga gaagcgcgcc 1440 
gtgggcatcg gcgccgtgat cttcggcttc ctgggcgccg ccggcagcaa catgggcgcc 1500 
gccagcatca ccctgaccgc ccaggcccgc cagctgctga gcggcatcgt gcagcagcag 1560 
agcaacctgc tgcgcgccat cgaggcccag cagcacatgc tgcagctgac cgtgtggggc 1620 
atcaagcagc tgcaggcccg cgtgctggcc atcgagcgct acctgaagga ccagcagctg 1680 
ctgggcatct ggggctgcag cggcaagctg atctgcacca ccaccgtgcc ctggaacagc 1740 
agctggagca acaagaccca gggcgagatc tgggagaaca tgacctggat gcagtgggac 1800 
aaggagatca gcaactacac cggcatcatc taccgcctgc tggaggagag ccagaaccag 1860 
caggagcaga acgagaagga cctgctggcc ctggacagcc gcaacaacct gtggagctgg 192 0 
ttcaacatca gcaactggct gtggtacatc aagatcttca tcatgatcgt gggcggcctg 1980 
atcggcctgc gcatcatctt cgccgtgctg agcatcgtga accgcgtgcg ccagggctac 2 04 0 
agccccctga gcttccagac cctgaccccc aacccccgcg gcctggaccg cctgggccgc 2100 
atcgaggagg agggcggcga gcaggaccgc gaccgcagca tccgcctggt gcagggcttc 2160 
ctggccctgg cctgggacga cctgcgcagc ctgtgcctgt tcagctacca ccgcctgcgc 2220 
gacctgatcc tggtgaccgc ccgcgtggtg gagctgctgg gccgcagcag cccccgcggc 2280 
ctgcagcgcg gctgggaggc cctgaagtac ctgggcagcc tggtgcagta ctggggcctg 2340 
gagctgaaga agagcgccac cagcctgctg gacagcatcg ccatcgccgt ggccgagggc 2400 
accgaccgca tcatcgaggt gatccagcgc atctaccgcg ccttctgcaa catcccccgc 2460 
cgcgtgcgcc agggcttcga ggccgccctg cag 24 93 

<210> 15 
<211> 2565 




<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence and gpl60 coding region of HIV 
strain AF110975 

<400> 15 

atgcgcgtgc gcggcatcct gcgcagctgg cagcagtggt ggatctgggg catcctgggc 60 
ttctggatct gcagcggcct gggcaacctg tgggtgaccg tgtacgacgg cgtgcccgtg 120 
tggcgcgagg ccagcaccac cctgttctgc gccagcgacg ccaaggccta cgagaaggag 180 
gtgcacaacg tgtgggccac ccacgcctgc gtgcccaccg accccaaccc ccaggagatc 240 
gagctggaca acgtgaccga gaacttcaac atgtggaaga acgacatggt ggaccagatg 300 
cacgaggaca tcatcagcct gtgggaccag agcctgaagc cccgcgtgaa gctgaccccc 360 
ctgtgcgtga ccctgaagtg caccaactac agcaccaact acagcaacac catgaacgcc 420 
accagctaca acaacaacac caccgaggag atcaagaact gcaccttcaa catgaccacc 480 
gagctgcgcg acaagaagca gcaggtgtac gccctgttct acaagctgga catcgtgccc 54 0 
ctgaacagca acagcagcga gtaccgcctg atcaactgca acaccagcgc catcacccag 600 
gcctgcccca aggtgagctt cgaccccatc cccatccact actgcgcccc cgccggctac 660 
gccatcctga agtgcaagaa caacaccagc aacggcaccg gcccctgcca gaacgtgagc 72 0 
accgtgcagt gcacccacgg catcaagccc gtggtgagca cccccctgct gctgaacggc 780 
agcctggccg agggcggcga gatcatcatc cgcagcaaga acctgagcaa caacgcctac 84 0 
accatcatcg tgcacctgaa cgacagcgtg gagatcgtgt gcacccgccc caacaacaac 900 
acccgcaagg gcatccgcat cggccccggc cagaccttct acgccaccga gaacatcatc 960 
ggcgacatcc gccaggccca ctgcaacatc agcgccggcg agtggaacaa ggccgtgcag 1020 
cgcgtgagcg ccaagctgcg cgagcacttc cccaacaaga ccatcgagtt ccagcccagc 1080 
agcggcggcg acctggagat caccacccac agcttcaact gccgcggcga gttcttctac 1140 
tgcaacacca gcaagctgtt caacagcagc tacaacggca ccagctaccg cggcaccgag 12 00 
agcaacagca gcatcatcac cctgccctgc cgcatcaagc agatcatcga catgtggcag 12 60 
aaggtgggcc gcgccatcta cgcccccccc atcgagggca acatcacctg cagcagcagc 1320 
atcaccggcc tgctgctggc ccgcgacggc ggcctggaca acatcaccac cgagatcttc 13 80 
cgcccccagg gcggcgacat gaaggacaac tggcgcaacg agctgtacaa gtacaaggtg 144 0 
gtggagatca agcccctggg cgtggccccc accgaggcca agcgccgcgt ggtggagcgc 1500 
gagaagcgcg ccgtgggcat cggcgccgtg atcttcggct tcctgggcgc cgccggcagc 1560 
aacatgggcg ccgccagcat caccctgacc gcccaggccc gccagctgct gagcggcatc 1620 
gtgcagcagc agagcaacct gctgcgcgcc atcgaggccc agcagcacat gctgcagctg 1680 
accgtgtggg gcatcaagca gctgcaggcc cgcgtgctgg ccatcgagcg ctacctgaag 1740 
gaccagcagc tgctgggcat ctggggctgc agcggcaagc tgatctgcac caccaccgtg 1800 
ccctggaaca gcagctggag caacaagacc cagggcgaga tctgggagaa catgacctgg 1860 
atgcagtggg acaaggagat cagcaactac accggcatca tctaccgcct gctggaggag 192 0 
agccagaacc agcaggagca gaacgagaag gacctgctgg ccctggacag ccgcaacaac 198 0 
ctgtggagct ggttcaacat cagcaactgg ctgtggtaca tcaagatctt catcatgatc 2 040 
gtgggcggcc tgatcggcct gcgcatcatc ttcgccgtgc tgagcatcgt gaaccgcgtg 2100 
cgccagggct acagccccct gagcttccag accctgaccc ccaacccccg cggcctggac 2160 
cgcctgggcc gcatcgagga ggagggcggc gagcaggacc gcgaccgcag catccgcctg 2220 
gtgcagggct tcctggccct ggcctgggac gacctgcgca gcctgtgcct gttcagctac 2280 
caccgcctgc gcgacctgat cctggtgacc gcccgcgtgg tggagctgct gggccgcagc 2340 
agcccccgcg gcctgcagcg cggctgggag gccctgaagt acctgggcag cctggtgcag 24 0 0 
tactggggcc tggagctgaa gaagagcgcc accagcctgc tggacagcat cgccatcgcc 2460 
gtggccgagg gcaccgaccg catcatcgag gtgatccagc gcatctaccg cgccttctgc 2 52 0 
aacatccccc gccgcgtgcg ccagggcttc gaggccgccc tgcag 2565 

<210> 16 
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<211> 1056 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic a 
gp41 coding region of HIV strain AF110975 



<400> 16 

gccgtgggca tcggcgccgt gatcttcggc 
gccgccagca tcaccctgac cgcccaggcc 
cagagcaacc tgctgcgcgc catcgaggcc 
ggcatcaagc agctgcaggc ccgcgtgctg 
ctgctgggca tctggggctg cagcggcaag 
agcagctgga gcaacaagac ccagggcgag 
gacaaggaga tcagcaacta caccggcatc 
cagcaggagc agaacgagaa ggacctgctg 
tggttcaaca tcagcaactg gctgtggtac 
ctgatcggcc tgcgcatcat cttcgccgtg 
tacagccccc tgagcttcca gaccctgacc 
cgcatcgagg aggagggcgg cgagcaggac 
ttcctggccc tggcctggga cgacctgcgc 
cgcgacctga tcctggtgac cgcccgcgtg 
ggcctgcagc gcggctggga ggccctgaag 
ctggagctga agaagagcgc caccagcctg 
ggcaccgacc gcatcatcga ggtgatccag 
cgccgcgtgc gccagggctt cgaggccgcc 



ttcctgggcg ccgccggcag caacatgggc 60 
cgccagctgc tgagcggcat cgtgcagcag 12 0 
cagcagcaca tgctgcagct gaccgtgtgg 180 
gccatcgagc gctacctgaa ggaccagcag 240 
ctgatctgca ccaccaccgt gccctggaac 300 
atctgggaga acatgacctg gatgcagtgg 3 60 
atctaccgcc tgctggagga gagccagaac 420 
gccctggaca gccgcaacaa cctgtggagc 480 
atcaagatct tcatcatgat cgtgggcggc 54 0 
ctgagcatcg tgaaccgcgt gcgccagggc 600 
cccaaccccc gcggcctgga ccgcctgggc 660 
cgcgaccgca gcatccgcct ggtgcagggc 72 0 
agcctgtgcc tgttcagcta ccaccgcctg 780 
gtggagctgc tgggccgcag cagcccccgc 84 0 
tacctgggca gcctggtgca gtactggggc 900 
ctggacagca tcgccatcgc cgtggccgag 960 
cgcatctacc gcgccttctg caacatcccc 1020 
ctgcag 1056 



<210> 17 
<211> 492 
<212> PRT 

<213> Human immunodeficiency virus 



<400> 17 

Met Glv Ala Arg Ala Ser He Leu Arg Gly Gly Lys Leu Asp Ala Trp 
15 10 15 

Glu Arg He Arg Leu Arg Pro Gly Gly Lys Lys Cys Tyr Met Met Lys 
20 25 30 

His Leu Val Trp Ala Ser Arg Glu Leu Glu Lys Phe Ala Leu Asn Pro 
35 40 45 

Gly Leu Leu Glu Thr Ser Glu Gly Cys Lys Gin He He Arg Gin Leu 
50 55 60 

His Pro Ala Leu Gin Thr Gly Ser Glu Glu Leu Lys Ser Leu Phe Asn 
65 70 75 80 

Thr Val Ala Thr Leu Tyr Cys Val His Glu Lys He Glu Val Arg Asp 
85 90 ^5 

Thr Lys Glu Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Cys Gin 
100 105 110 
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Gin Lys He Gin Gin Ala Glu Ala Ala Asp Lys Gly Lys Val Ser Gin 
115 120 125 

Asn Tyr Pro He Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala 
130 135 140 

He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val He Glu Glu Lys 

150 155 160 

Ala Phe Ser Pro Glu Val He Pro Met Phe Thr Ala Leu Ser Glu Gly 
165 170 175 

Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His 
180 185 190 

Gin Ala Ala Met Gin Met Leu Lys Asp Thr He Asn Glu Glu Ala Ala 
195 200 205 

Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He Ala Pro Gly 
210 215 220 

Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr 
225 230 235 240 

Leu Gin Glu Gin He Ala Trp Met Thr Ser Asn Pro Pro He Pro Val 
245 250 255 

Gly Asp He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He Val 
260 265 270 

Arg Met Tyr Ser Pro Val Ser He Leu Asp He Lys Gin Gly Pro Lys 
275 280 285 

Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Phe Lys Thr Leu Arg Ala 
290 295 300 

Glu Gin Ser Thr Gin Glu Val Lys Asn Trp Met Thr Asp Thr Leu Leu 
305 310 315 320 

Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Arg Ala Leu Gly 
325 330 335 

Pro Gly Ala Ser Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly 
340 . 345 350 

Gly Pro Ser His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Ala 
355 360 365 

Asn Thr Ser Val Met Met Gin Lys Ser Asn Phe Lys Gly Pro Arg Arg 
370 375 380 

He Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His He Ala Arg Asn 

390 395 400 
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Cys Arg Ala Pro 



His Gin Met Lys 
420 

He Trp Pro Ser 
435 

Pro Glu Pro Thr 
450 

Thr Pro Gly Gin 
465 

Leu Lys Ser Leu 




Arg Lys Lys Gly 
405 

Asp Cys Thr Glu 



His Lys Gly Arg 
440 

Ala Pro Pro Ala 
455 

Lys Gin Glu Ser 
470 

Phe Gly Asn Asp 
485 



Cys Trp Lys Cys 
410 

Arg Gin Ala Asn 
425 

Pro Gly Asn Phe 



Glu Ser Phe Arg 
460 

Lys Asp Arg Glu 
475 

Pro Leu Ser Gin 
490 




Gly Lys Glu Gly 
415 

Phe Leu Gly Lys 
430 

Leu Gin Ser Arg 
445 

Phe Glu Glu Thr 



Thr Leu Thr Ser 
480 



<210> 18 
<211> 81 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence of HIV strain AF110968 

<400> 18 

^^tgcgcgtga tgggcatcct gaagaactac cagcagtggt ggatgtgggg catcctgggc 60 
ttctggatgc tgatcatcag c 81 

<210> 19 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence of HIV strain AF110975 

<400> 19 

atgcgcgtgc gcggcatcct gcgcagctgg cagcagtggt ggatctgggg catcctgggc 60 
ttctggatct gc 72 

<210> 20 
<211> 1479 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Gag 
coding sequence of HIV strain AF110965 



13 



• 



<400> 20 



atgggcgccc gcgccagcat cctgcgcggc ggcaagctgg acgcctggga gcgcatccgc 60 
ctgcgccccg gcggcaagaa gtgctacatg atgaagcacc tggtgtgggc cagccgcgag 12 0 
ctggagaagt tcgccctgaa ccccggcctg ctggagacca gcgagggctg caagcagatc 180 
atccgccagc tgcaccccgc cctgcagacc ggcagcgagg agctgaagag cctgttcaac 24 0 
accgtggcca ccctgtactg cgtgcacgag aagatcgagg tgcgcgacac caaggaggcc 3 00 
ctggacaaga tcgaggagga gcagaacaag tgccagcaga agatccagca ggccgaggcc 3 60 
gccgacaagg gcaaggtgag ccagaactac cccatcgtgc agaacctgca gggccagatg 42 0 
gtgcaccagg ccatcagccc ccgcaccctg aacgcctggg tgaaggtgat cgaggagaag 4 80 
gccttcagcc ccgaggtgat ccccatgttc accgccctga gcgagggcgc caccccccag 54 0 
gacctgaaca ccatgctgaa caccgtgggc ggccaccagg ccgccatgca gatgctgaag 60 0 
gacaccatca acgaggaggc cgccgagtgg gaccgcgtgc accccgtgca cgccggcccc 660 
atcgcccccg gccagatgcg cgagccccgc ggcagcgaca tcgccggcac caccagcacc 72 0 
ctgcaggagc agatcgcctg gatgaccagc aaccccccca tccccgtggg cgacatctac 780 
aagcgctgga tcatcctggg cctgaacaag atcgtgcgca tgtacagccc cgtgagcatc 84 0 
ctggacatca agcagggccc caaggagccc ttccgcgact acgtggaccg cttcttcaag 900 
accctgcgcg ccgagcagag cacccaggag gtgaagaact ggatgaccga caccctgctg 960 
gtgcagaacg ccaaccccga ctgcaagacc atcctgcgcg ccctgggccc cggcgccagc 1020 
ctggaggaga tgatgaccgc ctgccagggc gtgggcggcc ccagccacaa ggcccgcgtg 10 80 
ctggccgagg ccatgagcca ggccaacacc agcgtgatga tgcagaagag caacttcaag 1140 
ggcccccgcc gcatcgtgaa gtgcttcaac tgcggcaagg agggccacat cgcccgcaac 1200 
tgccgcgccc cccgcaagaa gggctgctgg aagtgcggca aggagggcca ccagatgaag 12 60 
gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 1320 
cccggcaact tcctgcagag ccgccccgag cccaccgccc cccccgccga gagcttccgc 1380 
ttcgaggaga ccacccccgg ccagaagcag gagagcaagg accgcgagac cctgaccagc 144 0 
ctgaagagcc tgttcggcaa cgaccccctg agccagtaa 1479 



<211> 1509 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Gag 
coding sequence of HIV strain AF110967 

<400> 21 

atgggcgccc gcgccagcat cctgcgcggc gagaagctgg acaagtggga gaagatccgc 60 
ctgcgccccg gcggcaagaa gcactacatg ctgaagcacc tggtgtgggc cagccgcgag 120 
ctggagggct tcgccctgaa ccccggcctg ctggagaccg ccgagggctg caagcagatc 180 
atgaagcagc tgcagcccgc cctgcagacc ggcaccgagg agctgcgcag cctgtacaac 240 
accgtggcca ccctgtactg cgtgcacgcc ggcatcgagg tgcgcgacac caaggaggcc 3 00 
ctggacaaga tcgaggagga gcagaacaag agccagcaga agacccagca ggccaaggag 3 60 
gccgacggca aggtgagcca gaactacccc atcgtgcaga acctgcaggg ccagatggtg 42 0 
caccaggcca tcagcccccg caccctgaac gcctgggtga aggtgatcga ggagaaggcc 480 
ttcagccccg aggtgatccc catgttcacc gccctgagcg agggcgccac cccccaggac 540 
ctgaacacca tgctgaacac cgtgggcggc caccaggccg ccatgcagat gctgaaggac 600 
accatcaacg aggaggccgc cgagtgggac cgcctgcacc ccgtgcaggc cggccccgtg 660 
gcccccggcc agatgcgcga cccccgcggc agcgacatcg ccggcgccac cagcaccctg 72 0 
caggagcaga tcgcctggat gaccagcaac ccccccgtgc ccgtgggcga catctacaag 780 
cgctggatca tcctgggcct gaacaagatc gtgcgcatgt acagccccgt gagcatcctg 84 0 
gacatccgcc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 900 
ctgcgcgccg agcaggccac ccaggacgtg aagaactgga tgaccgagac cctgctggtg 960 
cagaacgcca accccgactg caagaccatc ctgcgcgccc tgggccccgg cgccaccctg 1020 



<210> 21 



14 



gaggagatga tgaccgcctg ccagggcgtg ggcggccccg gccacaaggc ccgcgtgctg 1080 
gccgaggcca tgagccaggc caacagcgtg aacatcatga tgcagaagag caacttcaag 114 0 
ggcccccgcc gcaacgtgaa gtgcttcaac tgcggcaagg agggccacat cgccaagaac 1200 
tgccgcgccc cccgcaagaa gggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 
gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 132 0 
cccggcaact tcctgcagaa ccgcagcgag cccgccgccc ccaccgtgcc caccgccccc 13 80 
cccgccgaga gcttccgctt cgaggagacc acccccgccc ccaagcagga gcccaaggac 1440 
cgcgagccct accgcgagcc cctgaccgcc ctgcgcagcc tgttcggcag cggccccctg 1500 
agccagtaa 1509 

<210> 22 
<211> 502 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 22 

Met Gly Ala Arg Ala Ser lie Leu Arg Gly Glu Lys Leu Asp Lys Trp 
15 10 15 

Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys His Tyr Met Leu Lys 
20 25 30 

His Leu Val Trp Ala Ser Arg Glu Leu Glu Gly Phe Ala Leu Asn Pro 
35 40 45 

Gly Leu Leu Glu Thr Ala Glu Gly Cys Lys Gin lie Met Lys Gin Leu 
50 55 60 

Gin Pro Ala Leu Gin Thr Gly Thr Glu Glu Leu Arg Ser Leu Tyr Asn 
65 70 75 80 

Thr Val Ala Thr Leu Tyr Cys Val His Ala Gly lie Glu Val Arg Asp 
85 90 95 

Thr Lys Glu Ala Leu Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Gin 
100 105 110 

Gin Lys Thr Gin Gin Ala Lys Glu Ala Asp Gly Lys Val Ser Gin Asn 
115 120 125 

Tyr Pro lie Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala lie 
130 135 140 

Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val lie Glu Glu Lys Ala 
145 150 155 160 

Phe Ser Pro Glu Val lie Pro Met Phe Thr Ala Leu Ser Glu Gly Ala 
165 170 175 

Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin 
180 185 190 

Ala Ala Met Gin Met Leu Lys Asp Thr lie Asn Glu Glu Ala Ala Glu 
195 200 205 
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Trp Asp Arg Leu His Pro Val Gin Ala Gly Pro Val Ala Pro Gly Gin 
210 215 220 

Met Arg Asp Pro Arg Gly Ser Asp He Ala Gly Ala Thr Ser Thr Leu 
225 230 235 240 

Gin Glu Gin He Ala Trp Met Thr Ser Asn Pro Pro Val Pro Val Gly 
245 250 255 

Asp He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg 
260 265 270 

Met Tyr Ser Pro Val Ser He Leu Asp He Arg Gin Gly Pro Lys Glu 
275 280 285 

Pro Phe Arg Asp Tyr Val Asp Arg Phe Phe Lys Thr Leu Arg Ala Glu 
290 295 300 

Gin Ala Thr Gin Asp Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val 
305 310 315 320 

Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Arg Ala Leu Gly Pro 
325 330 335 

Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly 
340 345 350 

Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Ala Asn 
355 360 365 

Ser Val Asn He Met Met Gin Lys, Ser Asn Phe Lys Gly Pro Arg Arg 
370 375 380 

Asn Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His He Ala Lys Asn 
385 390 395 400 

Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly 
405 410 415 

His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Phe Leu Gly Lys 
420 425 430 

He Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin Asn Arg 
435 440 445 

Ser Glu Pro Ala Ala Pro Thr Val Pro Thr Ala Pro Pro Ala Glu Ser 
450 455 460 

Phe Arg Phe Glu Glu Thr Thr Pro Ala Pro Lys Gin Glu Pro Lys Asp 
465 470 475 480 

Arg Glu Pro Tyr Arg Glu Pro Leu Thr Ala Leu Arg Ser Leu Phe Gly 
485 490 495 
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Ser Gly Pro Leu Ser Gin 
500 



<210> 23 
<211> 849 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 23 

Met Arg Val Met Gly lie Leu Lys Asn Tyr Gin Gin Trp Trp Met Trp 
15 10 15 

Gly He Leu Gly Phe Trp Met Leu He He Ser Ser Val Val Gly Asn 
20 25 30 

Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys 
35 40 45 

Thr Thr Leu Phe Cys Thr Ser Asp Ala Lys Ala Tyr Glu Thr Glu Val 
50 55 60 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 

Gin Glu He Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 

Asn Asp Met Val Asp Gin Met His Glu Asp He He Ser Leu Trp Asp 
100 105 110 

Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Lys Cys Arg Asn Val Asn Ala Thr Asn Asn He Asn Ser Met He Asp 
130 135 140 

Asn Ser Asn Lys Gly Glu Met Lys Asn Cys Ser Phe Asn Val Thr Thr 
145 150 155 160 

Glu Leu Arg Asp Arg Lys Gin Glu Val His Ala Leu Phe Tyr Arg Leu 
165 170 175 

Asp Val Val Pro Leu Gin Gly Asn Asn Ser Asn Glu Tyr Arg Leu He 
180 185 190 

Asn Cys Asn Thr Ser Ala He Thr Gin Ala Cys Pro Lys Val Ser Phe 
195 200 205 

Asp Pro He Pro He His Tyr Cys Thr Pro Ala Gly Tyr Ala He Leu 
210 215 220 

Lys Cys Asn Asn Gin Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val 
225 230 235 240 
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Ser Ser Val Gin Cys Ala His Gly He Lys Pro Val Val Ser Thr Gin 
245 250 255 

Leu Leu Leu Asn Gly Ser Leu Ala Lys Gly Glu He He He Arg Ser 
260 265 270 

Glu Asn Leu Ala Asn Asn Ala Lys He He He Val Gin Leu Asn Lys 
275 280 285 

Pro Val Lys He Val Cys Val Arg Pro Asn Asn Asn Thr Arg Lys Ser 
290 295 300 

Val Arg He Gly Pro Gly Gin Thr Phe Tyr Ala Thr Gly Glu He He 
305 310 315 320 

Gly Asp He Arg Gin Ala Tyr Cys He He Asn Lys Thr Glu Trp Asn 
325 330 335 

Ser Thr Leu Gin Gly Val Ser Lys Lys Leu Glu Glu His Phe Ser Lys 
340 345 350 

Lys Ala He Lys Phe Glu Pro Ser Ser Gly Gly Asp Leu Glu He Thr 
355 360 365 

Thr His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asp Thr Ser 
370 375 380 

Gin Leu Phe Asn Ser Thr Tyr Ser Pro Ser Phe Asn Gly Thr Glu Asn 
385 390 395 400 

Lys Leu Asn Gly Thr He Thr He Thr Cys Arg He Lys Gin He He 
405 410 415 

Asn Met Trp Gin Lys Val Gly Arg Ala Met Tyr Ala Pro Pro He Ala 
420 425 430 

Gly Asn Leu Thr Cys Glu Ser Asn He Thr Gly Leu Leu Leu Thr Arg 
435 440 445 

Asp Gly Gly Lys Thr Gly Pro Asn Asp Thr Glu He Phe Arg Pro Gly 
450 455 460 

Gly Gly Asp Met Arg Asp Asn Trp Arg Asn Glu Leu Tyr Lys Tyr Lys 
465 470 475 480 

Val Val Glu He Lys Pro Leu Gly Val Ala Pro Thr Glu Ala Lys Arg 
485 490 495 

Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly He Gly Ala Val Phe 
500 505 510 

Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser He 



515 



520 



525 
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• # 



Thr Leu Thr Val Gin Ala Arg Leu Leu Leu Ser Gly lie Val Gin Gin 
530 535 540 

Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin 
545 550 555 560 

Leu Thr Val Trp Gly lie Lys Gin Leu Gin Thr Arg lie Leu Ala Val 
565 570 575 

Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly He Trp Gly Cys Ser 
580 585 590 

Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ser Ser Trp Ser 
595 600 605 

Asn Arg Ser His Asp Glu He Trp Asp Asn Met Thr Trp Met Gin Trp 
610 615 620 

Asp Arg Glu He Asn Asn Tyr Thr Asp Thr He Tyr Arg Leu Leu Glu 
625 630 635 640 

Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Lys Asp Leu Leu Ala Leu 
645 650 655 

Asp Ser Trp Gin Asn Leu Trp Asn Trp Phe Ser lie Thr Asn Trp Leu 
660 665 670 

Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly Leu 
675 680 685 

Arg He He Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly 
690 695 700 

Tyr Ser Pro Leu Pro Phe Gin Thr Leu Thr Pro Asn Pro Arg Glu Pro 
705 710 715 720 

Asp Arg Leu Gly Arg He Glu Glu Glu Gly Gly Glu Gin Asp Arg Gly 
725 730 735 

Arg Ser He Arg Leu Val Ser Gly Phe Leu Ala Leu Ala Trp Asp Asp 
740 745 750 

Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe He 
755 760 765 

Leu He Ala Ala Arg Val Leu Glu Leu Leu Gly Gin Arg Gly Trp Glu 
770 775 780 

Ala Leu Lys Tyr Leu Gly Ser Leu Val Gin Tyr Trp Gly Leu Glu Leu 
785 790 795 800 

Lys Lys Ser Ala He Ser Leu Leu Asp Thr He Ala He Ala Val Ala 
805 810 815 
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Glu Gly Thr Asp 
820 

lie Arg Asn lie 
835 

Gin 




Arg He He Glu 

Pro Arg Arg He 
840 



Phe He Gin Arg 
825 

Arg Gin Gly Phe 




He Cys Arg Ala 
830 

Glu Ala Ala Leu 
845 



<210> 24 
<211> 855 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 24 

Met Arg Val Arg Gly He Leu Arg Ser Trp Gin Gin Trp Trp He Trp 
15 10 15 

Gly He Leu Gly Phe Trp He Cys Ser Gly Leu Gly Asn Leu Trp Val 
20 25 30 

Thr Val Tyr Asp Gly Val Pro Val Trp Arg Glu Ala Ser Thr Thr Leu 
35 40 45 

Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
50 55 60 

Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu He 
65 70 75 80 

Glu Leu Asp Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
85 90 95 

Val Asp Gin Met His Glu Asp He He Ser Leu Trp Asp Gin Ser Leu 
100 105 110 

Lys Pro Arg Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lys Cys Thr 
115 120 125 

Asn Tyr Ser Thr Asn Tyr Ser Asn Thr Met Asn Ala Thr Ser Tyr Asn 
130 135 140 

Asn Asn Thr Thr Glu Glu He Lys Asn Cys • Thr Phe Asn Met Thr Thr 
145 150 155 160 

Glu Leu Arg Asp Lys Lys Gin Gin Val Tyr Ala Leu Phe Tyr Lys Leu 
165 170 175 

Asp He Val Pro Leu Asn Ser Asn Ser Ser Glu Tyr Arg Leu He Asn 
180 185 190 

Cys Asn Thr Ser Ala He Thr Gin Ala Cys Pro Lys Val Ser Phe Asp 
195 200 205 
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Pro lie Pro lie His Tyr Cys Ala Pro Ala Gly Tyr Ala He Leu Lys 
210 215 220 

Cys Lys Asn Asn Thr Ser Asn Gly Thr Gly Pro Cys Gin Asn Val Ser 
225 230 235 240 

Thr Val Gin Cys Thr His Gly He Lys Pro Val Val Ser Thr Pro Leu 
245 250 255 

Leu Leu Asn Gly Ser Leu Ala Glu Gly Gly Glu He He He Arg Ser 
260 265 270 

Lys Asn Leu Ser Asn Asn Ala Tyr Thr He He Val His Leu Asn Asp 
275 280 285 

Ser Val Glu He Val Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly 
290 295 300 

He Arg He Gly Pro Gly Gin Thr Phe Tyr Ala Thr Glu Asn He He 
305 310 315 320 

Gly Asp He Arg Gin Ala His Cys Asn He Ser Ala Gly Glu Trp Asn 



Lys Ala Val Gin Arg Val Ser Ala Lys Leu Arg Glu His Phe Pro Asn 
340 345 350 

Lys Thr He Glu Phe Gin Pro Ser Ser Gly Gly Asp Leu Glu He Thr 
355 360 365 

Thr His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser 
370 375 380 

Lys Leu Phe Asn Ser Ser Tyr Asn Gly Thr Ser Tyr Arg Gly Thr Glu 
385 390 395 400 

Ser Asn Ser Ser He He Thr Leu Pro Cys Arg He Lys Gin He He 
405 410 415 

Asp Met Trp Gin Lys Val Gly Arg Ala He Tyr Ala Pro Pro He Glu 
420 425 430 

Gly Asn He Thr Cys Ser Ser Ser He Thr Gly Leu Leu Leu Ala Arg 
435 440 445 

Asp Gly Gly Leu Asp Asn He Thr Thr Glu He Phe Arg Pro Gin Gly 
450 455 460 

Gly Asp Met Lys Asp Asn Trp Arg Asn Glu Leu Tyr Lys Tyr Lys Val 
465 470 475 480 

Val Glu He Lys Pro Leu Gly Val Ala Pro Thr Glu Ala Lys Arg Arg 



325 



330 



335 



485 



490 



495 
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Val Val Glu Arg 
500 

Gly Phe Leu Gly 
515 

Leu Thr Ala Gin 
530 

Ser Asn Leu Leu 
.545 

Thr Val Trp Gly 




Glu Lys Arg Ala 



Ala Ala Gly Ser 
520 

Ala Arg Gin Leu 
535 

Arg Ala lie Glu 
550 

lie Lys Gin Leu 
565 



Val Gly He Gly 
505 

Asn Met Gly Ala 



Leu Ser Gly He 
540 

Ala Gin Gin His 
555 

Gin Ala Arg Val 
570 




Ala Val He Phe 
510 

Ala Ser He Thr 
525 

Val Gin Gin Gin 



Met Leu Gin Leu 
560 

Leu Ala He Glu 
575 



Arg Tyr Leu Lys 
580 

Lys Leu He Cys 
595 

Lys Thr Gin Gly 
610 

Lys Glu He Ser 
625 

Ser Gin Asn Gin 



Ser Arg Asn Asn 
660 

Tyr He Lys He 
675 

He He Phe Ala 
690 

Ser Pro Leu Ser 
705 



Asp Gin Gin Leu 



Thr Thr Thr Val 
600 

Glu He Trp Glu 
615 

Asn Tyr Thr Gly 
630 

Gin Glu Gin Asn 
645 

Leu Trp Ser Trp 



Phe He Met He 
680 

Val Leu Ser He 
695 

Phe Gin Thr Leu 
710 



Leu Gly He Trp 
585 

Pro Trp Asn Ser 



Asn Met Thr Trp 
620 

He He Tyr Arg 
635 

Glu Lys Asp Leu 
650 

Phe Asn He Ser 
665 

Val Gly Gly Leu 



Val Asn Arg Val 
700 

Thr Pro Asn Pro 
715 



Gly Cys Ser Gly 
590 

Ser Trp Ser Asn 
605 

Met Gin Trp Asp 



Leu Leu Glu Glu 
640 
« 

Leu Ala Leu Asp 
655 

Asn Trp Leu Trp 
670 

He Gly Leu Arg 
685 

Arg Gin Gly Tyr 



Arg Gly Leu Asp 
720 



Arg Leu Gly Arg 



Ser He Arg Leu 
740 

Arg Ser Leu Cys 
755 

Val Thr Ala Arg 
770 



He Glu Glu Glu 
725 

Val Gin Gly Phe 



Leu Phe Ser Tyr 
760 

Val Val Glu Leu 
775 



Gly Gly Glu Gin 
730 

Leu Ala Leu Ala 
745 

His Arg Leu Arg 



Leu Gly Arg Ser 
780 



Asp Arg Asp Arg 
735 

Trp Asp Asp Leu 
750 

Asp Leu He Leu 
765 

Ser Pro Arg Gly 
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• # 



Leu Gin Arg Gly 
785 

Tyr Trp Gly Leu 



He Ala He Ala 
820 

Gin Arg He Tyr 
835 

Gly Phe Glu Ala 
850 



Trp Glu Ala Leu 
790 

Glu Leu Lys Lys 
805 

Val Ala Glu Gly 



Arg Ala Phe Cys 
840 

Ala Leu Gin 
855 



Lys Tyr Leu Gly 
795 

Ser Ala Thr Ser 
810 

Thr Asp Arg He 
825 

Asn He Pro Arg 



Ser Leu Val Gin 
800 

Leu Leu Asp Ser 
815 

He Glu Val He 
830 

Arg Val Arg Gin 
845 



<210> 25 
<211> 20 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 25 

Asp He Lys Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg 
15 10 15 

Phe Phe Lys Thr 
20 



<210> 26 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 26 

gacataaaac aaggaccaaa agagcccttt agagactatg tagaccggtt ctttaaaacc 60 

<210> 27 
<211> 20 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 27 

Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg 
15 10 15 

Phe Phe Lys Thr 
20 



<210> 28 

<211> 47 

<212> PRT 

<213> Human immunodeficiency virus 
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<400> 28 
Thr He Thr He 
1 

Lys Val Gly Arg 
20 

Cys Glu Ser Asn 
35 



Thr Cys Arg He 
5 

Ala Met .Tyr Ala 

He Thr Gly Leu 
40 



Lys Gin He He 
10 

Pro Pro He Ala 
25 

Leu Leu Thr Arg 



Asn Met Trp Gin 
15 

Gly Asn Leu Thr 
30 

Asp Gly Gly 
45 



<210> 29 
<211> 48 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 29 

Ser He He Thr Leu Pro Cys Arg He Lys Gin He He Asp Met Trp 
1 5 10 15 

Gin Lys Val Gly Arg Ala He Tyr Ala Pro Pro He Glu Gly Asn He 
20 25 30 

Thr Cys Ser Ser Ser He Thr Gly Leu Leu Leu Ala Arg Asp Gly Gly 
35 40 45 



<210> 30 
<211> 2469 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PR975 (+) 



<400> 30 

gtcgacgcca 

cgcagcaact 

cacatcgccc 

ggccaccaga 

ttcccccagg 

agccgcgagc 

ggcaccctga 

ggcggccaga 

atgagcctgc 

gtgcgccagt 

ctgatcggcc 

accctgaact 

gacggcccca 

atctgcgagg 

aacacccccg 

ttccgcgagc 

cccgccggcc 

agcgtgcccc 



ccatggccga 
tcaagggccc 
gcaactgccg 
tgaaggactg 
gcaaggcccg 
tgcaggtgcg 
acttccccca 
tcaaggaggc 
ccggcaagtg 
acgaccagat 
ccacccccgt 
tccccatcag 
aggtgaagca 
agatggagaa 
tgttcgccat 
tgaacaagcg 
tgaagaagaa 
tggacgagga 



ggccatgagc 
caagcgcatc 
cgccccccgc 
caccgagcgc 
cgagttcccc 
cggcgacaac 
gatcaccctg 
cctgctggac 
gaagcccaag 
cctgatcgag 
gaacatcatc 
ccccatcgag 
gtggcccctg 
ggagggcaag 
caagaagaag 
cacccaggac 
gaagagcgtg 
cttccgcaag 



caggccacca 
atcaagtgct 
aagaagggct 
caggccaact 
agcgagcaga 
ccccgcagcg 
tggcagcgcc 
accggcgccg 
atgatcggcg 
atctgcggca 
ggccgcaaca 
accgtgcccg 
accgaggaga 
atcaccaaga 
gacagcacca 
ttctgggagg 
accgtgctgg 
tacaccgcct 



gcgccaacat 
tcaactgcgg 
gctggaagtg 
tcttccgcga 
accgcgccaa 
aggccggcgc 
ccctggtgag 
acgacaccgt 
gcatcggcgg 
agaaggccat 
tgctgaccca 
tgaagctgaa 
agatcaaggc 
tcggccccga 
agtggcgcaa 
tgcagctggg 
acgtgggcga 
tcaccatccc 



cctgatgcag 
caaggagggc 
cggcaaggag 
ggacctggcc 
cagccccacc 
cgagcgccag 
catcaaggtg 
gctggaggag 
cttcatcaag 
cggcaccgtg 
gctgggctgc 
gcccggcatg 
cctgaccgcc 
gaacccctac 
gctggtggac 
catcccccac 
cgcctacttc 
cagcatcaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



24 



aacgagaccc ccggcatccg ctaccagtac aacgtgctgc cccagggctg gaagggcagc 1140 
cccagcatct tccagagcag catgaccaag atcctggagc ccttccgcgc ccgcaacccc 1200 
gagatcgtga tctaccagta catggacgac ctgtacgtgg gcagcgacct ggagatcggc 1260 
cagcaccgcg ccaagatcga ggagctgcgc aagcacctgc tgcgctgggg cttcaccacc 1320 
cccgacaaga agcaccagaa ggagcccccc ttcctgtgga tgggctacga gctgcacccc 13 80 
gacaagtgga ccgtgcagcc catcgagctg cccgagaagg agagctggac cgtgaacgac 144 0 
atccagaagc tggtgggcaa gctgaactgg gccagccaga tctaccccgg catcaaggtg 1500 
cgccagctgt gcaagctgct gcgcggcgcc aaggccctga ccgacatcgt gcccctgacc 1560 
gaggaggccg agctggagct ggccgagaac cgcgagatcc tgcgcgagcc cgtgcacggc 162 0 
gtgtactacg accccagcaa ggacctggtg gccgagatcc agaagcaggg ccacgaccag 1680 
tggacctacc agatctacca ggagcccttc aagaacctga agaccggcaa gtacgccaag 174 0 
atgcgcaccg cccacaccaa cgacgtgaag cagctgaccg aggccgtgca gaagatcgcc 1800 
atggagagca tcgtgatctg gggcaagacc cccaagttcc gcctgcccat ccagaaggag 1860 
acctgggaga cctggtggac cgactactgg caggccacct ggatccccga gtgggagttc 192 0 
gtgaacaccc cccccctggt gaagctgtgg taccagctgg agaaggagcc catcatcggc 1980 
gccgagacct tctacgtgga cggcgccgcc aaccgcgaga ccaagatcgg caaggccggc 2 04 0 
tacgtgaccg accggggccg gcagaagatc gtgagcctga ccgagaccac caaccagaag 210 0 
accgagctgc aggccatcca gctggccctg caggacagcg gcagcgaggt gaacatcgtg 2160 
accgacagcc agtacgccct gggcatcatc caggcccagc ccgacaagag cgagagcgag 2220 
ctggtgaacc agatcatcga gcagctgatc aagaaggaga aggtgtacct gagctgggtg 22 8 0 
cccgcccaca agggcatcgg cggcaacgag cagatcgaca agctggtgag caagggcatc 2340 
cgcaaggtgc tgttcctgga cggcatcgat ggcggcatcg tgatctacca gtacatggac 2400 
gacctgtacg tgggcagcgg cggccctagg atcgattaaa agcttcccgg ggctagcacc 2460 
ggtgaattc 2469 

<210> 31 
<211> 2463 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PR975YM 
<400> 31 

gtcgacgcca ccatggccga 
cgcagcaact tcaagggccc 
cacatcgccc gcaactgccg 
ggccaccaga tgaaggactg 
ttcccccagg gcaaggcccg 
agccgcgagc tgcaggtgcg 
ggcaccctga acttccccca 
ggcggccaga tcaaggaggc 
atgagcctgc ccggcaagtg 
gtgcgccagt acgaccagat 
ctgatcggcc ccacccccgt 
accctgaact tccccatcag 
gacggcccca aggtgaagca 
atctgcgagg agatggagaa 
aacacccccg tgttcgccat 
ttccgcgagc tgaacaagcg 
cccgccggcc tgaagaagaa 
agcgtgcccc tggacgagga 
aacgagaccc ccggcatccg 
cccagcatct tccagagcag 



ggccatgagc caggccacca gcgccaacat cctgatgcag 60 
caagcgcatc atcaagtgct tcaactgcgg caaggagggc 12 0 
cgccccccgc aagaagggct gctggaagtg cggcaaggag 18 0 
caccgagcgc caggccaact tcttccgcga ggacctggcc 240 
cgagttcccc agcgagcaga accgcgccaa cagccccacc 3 00 
cggcgacaac ccccgcagcg aggccggcgc cgagcgccag 3 60 
gatcaccctg tggcagcgcc ccctggtgag catcaaggtg 42 0 
cctgctggac accggcgccg acgacaccgt gctggaggag 4 80 
gaagcccaag atgatcggcg gcatcggcgg cttcatcaag 540 
cctgatcgag atctgcggca agaaggccat cggcaccgtg 600 
gaacatcatc ggccgcaaca tgctgaccca gctgggctgc 660 
ccccatcgag accgtgcccg tgaagctgaa gcccggcatg 720 
gtggcccctg accgaggaga agatcaaggc cctgaccgcc 780 
ggagggcaag atcaccaaga tcggccccga gaacccctac 840 
caagaagaag gacagcacca agtggcgcaa gctggtggac 900 
cacccaggac ttctgggagg tgcagctggg catcccccac 960 
gaagagcgtg accgtgctgg acgtgggcga cgcctacttc 1020 
cttccgcaag tacaccgcct tcaccatccc cagcatcaac 1080 
ctaccagtac aacgtgctgc cccagggctg gaagggcagc 114 0 
catgaccaag atcctggagc ccttccgcgc ccgcaacccc 1200 
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gagatcgtga tctaccaggc ccccctgtac 
cgcgccaaga tcgaggagct gcgcaagcac 
aagaagcacc agaaggagcc ccccttcctg 
tggaccgtgc agcccatcga gctgcccgag 
aagctggtgg gcaagctgaa ctgggccagc 
ctgtgcaagc tgctgcgcgg cgccaaggcc 
gccgagctgg agctggccga gaaccgcgag 
tacgacccca gcaaggacct ggtggccgag 
taccagatct accaggagcc cttcaagaac 
accgcccaca ccaacgacgt gaagcagctg 
agcatcgtga tctggggcaa gacccccaag 
gagacctggt ggaccgacta ctggcaggcc 
accccccccc tggtgaagct gtggtaccag 
accttctacg tggacggcgc cgccaaccgc 
accgaccggg gccggcagaa gatcgtgagc 
ctgcaggcca tccagctggc cctgcaggac 
agccagtacg ccctgggcat catccaggcc 
aaccagatca tcgagcagct gatcaagaag 
cacaagggca tcggcggcaa cgagcagatc 
gtgctgttcc tggacggcat cgatggcggc 
tacgtgggca gcggcggccc taggatcgat 
ttc 



9tgggcagcg acctggagat cggccagcac 12 60 
ctgctgcgct ggggcttcac cacccccgac 132 0 
tggatgggct acgagctgca ccccgacaag 13 80 
aaggagagct ggaccgtgaa cgacatccag 144 0 
cagatctacc ccggcatcaa ggtgcgccag 1500 
ctgaccgaca tcgtgcccct gaccgaggag 1560 
atcctgcgcg agcccgtgca cggcgtgtac 162 0 
atccagaagc agggccacga ccagtggacc 1680 
ctgaagaccg gcaagtacgc caagatgcgc 174 0 
accgaggccg tgcagaagat cgccatggag 1800 
ttccgcctgc ccatccagaa ggagacctgg 1860 
acctggatcc ccgagtggga gttcgtgaac 1920 
ctggagaagg agcccatcat cggcgccgag 1980 
gagaccaaga tcggcaaggc cggctacgtg 2 04 0 
ctgaccgaga ccaccaacca gaagaccgag 2100 
agcggcagcg aggtgaacat cgtgaccgac 2160 
cagcccgaca agagcgagag cgagctggtg 2220 
gagaaggtgt acctgagctg ggtgcccgcc 22 80 
gacaagctgg tgagcaaggg catccgcaag 2 34 0 
atcgtgatct accagtacat ggacgacctg 2400 
taaaagcttc ccggggctag caccggtgaa 24 60 

2463 



<210> 32 
<211> 2457 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial S 
<400> 32 

gtcgacgcca ccatggccga ggccatgagc 
cgcagcaact tcaagggccc caagcgcatc 
cacatcgccc gcaactgccg cgccccccgc 
ggccaccaga tgaaggactg caccgagcgc 
ttcccccagg gcaaggcccg cgagttcccc 
agccgcgagc tgcaggtgcg cggcgacaac 
ggcaccctga acttccccca gatcaccctg 
ggcggccaga tcaaggaggc cctgctggac 
atgagcctgc ccggcaagtg gaagcccaag 
gtgcgccagt acgaccagat cctgatcgag 
ctgatcggcc ccacccccgt gaacatcatc 
accctgaact tccccatcag ccccatcgag 
gacggcccca aggtgaagca gtggcccctg 
atctgcgagg agatggagaa ggagggcaag 
aacacccccg tgttcgccat caagaagaag 
ttccgcgagc tgaacaagcg cacccaggac 
cccgccggcc tgaagaagaa gaagagcgtg 
agcgtgcccc tggacgagga cttccgcaag 
aacgagaccc ccggcatccg ctaccagtac 
cccagcatct tccagagcag catgaccaag 
gagatcgtga tctaccaggc ccccctgtac 
cgcgccaaga tcgaggagct gcgcaagcac 



:nce: PR975YMWM 



caggccacca gcgccaacat cctgatgcag 60 
atcaagtgct tcaactgcgg caaggagggc 12 0 
aagaagggct gctggaagtg cggcaaggag 180 
caggccaact tcttccgcga ggacctggcc 24 0 
agcgagcaga accgcgccaa cagccccacc 3 00 
ccccgcagcg aggccggcgc cgagcgccag 3 60 
tggcagcgcc ccctggtgag catcaaggtg 42 0 
accggcgccg acgacaccgt gctggaggag 480 
atgatcggcg gcatcggcgg cttcatcaag 540 
atctgcggca agaaggccat cggcaccgtg 600 
ggccgcaaca tgctgaccca gctgggctgc 660 
accgtgcccg tgaagctgaa gcccggcatg 720 
accgaggaga agatcaaggc cctgaccgcc 780 
atcaccaaga tcggccccga gaacccctac 840 
gacagcacca agtggcgcaa gctggtggac 900 
ttctgggagg tgcagctggg catcccccac 960 
accgtgctgg acgtgggcga cgcctacttc 102 0 
tacaccgcct tcaccatccc cagcatcaac 1080 
aacgtgctgc cccagggctg gaagggcagc 114 0 
atcctggagc ccttccgcgc ccgcaacccc 1200 
gtgggcagcg acctggagat cggccagcac 12 6 0 
ctgctgcgct ggggcttcac cacccccgac 1320 
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aagaagcacc agaaggagcc ccccttcctg cccatcgagc tgcaccccga caagtggacc 13 8 0 
gtgcagccca tcgagctgcc cgagaaggag agctggaccg tgaacgacat ccagaagctg 144 0 
gtgggcaagc tgaactgggc cagccagatc taccccggca tcaaggtgcg ccagctgtgc 1500 
aagctgctgc gcggcgccaa ggccctgacc gacatcgtgc ccctgaccga ggaggccgag 1560 
ctggagctgg ccgagaaccg cgagatcctg cgcgagcccg tgcacggcgt gtactacgac 162 0 
cccagcaagg acctggtggc cgagatccag aagcagggcc acgaccagtg gacctaccag 168 0 
atctaccagg agcccttcaa gaacctgaag accggcaagt acgccaagat gcgcaccgcc 174 0 
cacaccaacg acgtgaagca gctgaccgag gccgtgcaga agatcgccat ggagagcatc 1800 
gtgatctggg gcaagacccc caagttccgc ctgcccatcc agaaggagac ctgggagacc 1860 
tggtggaccg actactggca ggccacctgg atccccgagt gggagttcgt gaacaccccc 192 0 
cccctggtga agctgtggta ccagctggag aaggagccca tcatcggcgc cgagaccttc 1980 
tacgtggacg gcgccgccaa ccgcgagacc aagatcggca aggccggcta cgtgaccgac 2 04 0 
cggggccggc agaagatcgt gagcctgacc gagaccacca accagaagac cgagctgcag 2100 
gccatccagc tggccctgca ggacagcggc agcgaggtga acatcgtgac cgacagccag 2160 
tacgccctgg gcatcatcca ggcccagccc gacaagagcg agagcgagct ggtgaaccag 222 0 
atcatcgagc agctgatcaa gaaggagaag gtgtacctga gctgggtgcc cgcccacaag 2280 
ggcatcggcg gcaacgagca gatcgacaag ctggtgagca agggcatccg caaggtgctg 2340 
ttcctggacg gcatcgatgg cggcatcgtg atctaccagt acatggacga cctgtacgtg 2400 
ggcagcggcg gccctaggat cgattaaaag cttcccgggg ctagcaccgg tgaattc 2457 

<210> 33 
<211> 9781 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 33 

tggaagggtt aatttactcc aagaaaaggc aagaaatcct tgatttgtgg gtctatcaca 60 
cacaaggctt cttccctgat tggcaaaact acacaccggg gccaggggtc agatatccac 12 0 
tgacctttgg atggtgctac aagctagtgc cagttgaccc aggggaggtg gaagaggcca 180 
acggaggaga agacaactgt ttgctacacc ctatgagcca acatggagca gaggatgaag 24 0 
atagagaagt attaaagtgg aagtttgaca gcctcctagc acgcagacac atggcccgcg 3 00 
agctacatcc ggagtattac aaagactgct gacacagaag ggactttccg cctgggactt 360 
tccactgggg cgttccggga ggtgtggtct gggcgggact tgggagtggt caaccctcag 42 0 
atgctgcata taagcagctg cttttcgcct gtactgggtc tctctcggta gaccagatct 480 
gagcctggga gccctctggc tatctaggga acccactgct taagcctcaa taaagcttgc 540 
cttgagtgct ttaagtagtg tgtgcccatc tgttgtgtga ctctggtaac tagagatccc 600 
tcagaccctt tgtggtagtg tggaaaatct ctagcagtgg cgcccgaaca gggaccagaa 660 
agtgaaagtg agaccagagg agatctctcg acgcaggact cggcttgctg aagtgcacac 720 
ggcaagaggc gagaggggcg gctggtgagt acgccaattt tacttgacta gcggaggcta 780 
gaaggagaga gatgggtgcg agagcgtcaa tattaagcgg cggaaaatta gataaatggg 84 0 
aaagaattag gttaaggcca gggggaaaga aacattatat gttaaaacat ctagtatggg 900 
caagcaggga gctggaaaga tttgcactta accctggcct gttagaaaca tcagaaggct 960 
gtaaacaaat aataaaacag ctacaaccag ctcttcagac aggaacagag gaacttagat 1020 
cattattcaa cacagtagca actctctatt gtgtacataa agggatagag gtacgagaca 1080 
ccaaggaagc cttagacaag atagaggaag aacaaaacaa atgtcagcaa aaagcacaac 114 0 
aggcaaaagc agctgacgaa aaggtcagtc aaaattatcc tatagtacag aatgcccaag 1200 
ggcaaatggt acaccaagct atatcaccta gaacattgaa tgcatggata aaagtaatag 1260 
aggaaaaggc tttcaatcca gaggaaatac ccatgtttac agcattatca gaaggagcca 132 0 
ccccacaaga tttaaacaca atgttaaata cagtgggggg acatcaagca gccatgcaaa 1380 
tgttaaaaga taccatcaat gaggaggctg cagaatggga taggacacat ccagtacatg 144 0 
cagggcctgt tgcaccaggc cagatgagag aaccaagggg aagtgacata gcaggaacta 1500 
ctagtaccct .tcaggaacaa atagcatgga tgacaagtaa tccacctatt ccagtagaag 1560 
acatctataa aagatggata attctggggt taaataaaat agtaagaatg tatagccctg 1620 
ttagcatttt ggacataaaa caagggccaa aagaaccctt tagagactat gtagaccggt 1680 
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tctttaaaac cttaagagct gaacaagcta 
ccttgttggt ccaaaatgcg aacccagatt 
gggcctcatt agaagaaatg atgacagcat 
caagagtgtt ggctgaggca atgagccaag 
attttaaagg ctctaacaga attattaaat 
ccagaaattg cagggcccct aggaaaaagg 
aaatgaaaga ctgtactgag aggcaggcta 
aggggaggcc agggaatttc ctccagaaca 
caacagcccc accagcagag agcttcaggt 
agaaagagag ggaaccttta acttccctca 
aataaaagta gagggccaga taaaggaggc 
attagaagaa atagatttgc cagggaaatg 
ttttatcaaa gtaagacagt atgatcaaat 
aggtacagta ttagtagggc ctacaccagt 
gcttggatgc acactaaatt ttccaattag 
accaggaatg gatggcccaa aggtcaaaca 
attaacagca atttgtgagg aaatggagaa 
taatccatat aacactccag tatttgccat 
attagtagat ttcagggaac tcaataaaag 
aataccacac ccagcaggat taaaaaagaa 
tgcatatttt tcagttcctt tagatgaaag 
tagtataaac aatgaaacac cagggattag 
gaaaggatca ccagcaatat tccagagtag 
aaaaaatcca gacatagtta tctatcaata 
agaaataggg caacatagag caaaaataga 
atttacaaca ccagacaaga aacatcaaaa 
actccatcct gacaaatgga cagtacaacc 
tgtcaatgat atacagaagt tagtgggaaa 
gattaaagta aggcaactct gtaaactcct 
accactaact gaagaagcag aattagaatt 
agtacatgga gtatattatg atccatcaaa 
gcatgaacaa tggacatatc aaatttatca 
gtatgcaaaa atgaggacta cccacactaa 
aaaaatagcc atggaaagca tagtaatatg 
ccaaaaagaa acatgggaga catggtggac 
gtgggagttt gttaataccc ctcccctagt 
catagcagga gtagaaactt tctatgtaga 
aaaagcaggg tatgttactg acagaggaag 
aaatcagaag actgagttac aagcaattca 
aaacatagta acagactcac agtatgcatt 
tgactcagag atatttaacc aaataataga 
gtcatgggta ccagcacata aaggaattgg 
taagggaatt aggaaagtgt tgtttctaga 
aaggtaccac agcaattgga gagcaatggc 
aaaagaaata gtagctagct gtgataaatg 
agtcgactgt agtccaggga tatggcaatt 
cctggtagca gtccatgtag ctagtggcta 
aggacaagaa acagcatatt ttatattaaa 
acatacagac aatggcagta attttaccag 
aggtatccaa caggaatttg gaattcccta 
catgaataaa gaattaaaga aaataatagg 
gacagcagta caaatggcag tattcattca 
gtacagtgca ggggaaagaa taatagacat 
acaaaaacaa attataagaa ttcaaaattt 



cacaagatgt aaagaattgg atgacagaca 174 0 
gtaagaccat tttaagagca ttaggaccag 1800 
gtcagggagt gggaggacct agccataaag 1860 
caaacagtaa catactagtg cagagaagca 1920 
gtttcaactg tggcaaagta gggcacatag 1980 
gctgttggaa atgtggacag gaaggacacc 2040 
attttttagg gaaaatttgg ccttcccaca 2100 
gaccagagcc aacagcccca ccagcagaac 2160 
tcgaggagac aacccccgtg ccgaggaagg 2220 
aatcactctt tggcagcgac cccttgtctc 2280 
tctcttagac acaggagcag atgatacagt 2340 
gaaaccaaaa atgatagggg gaattggagg 24 00 
acttatagaa atttgtggaa aaaaggctat 24 60 
caacataatt ggaagaaatc tgttaactca 2520 
tcctattgaa actgtaccag taaaattaaa 2580 
atggccattg acagaagaaa aaataaaagc 264 0 
ggaaggaaaa attacaaaaa ttgggcctga 2700 
aaaaaagaag gacagtacta agtggagaaa 2 760 
aactcaagac ttttgggaag ttcaattagg 2820 
aaaatcagtg acagtgctag atgtggggga 2 880 
cttcaggaaa tatactgcat tcaccatacc 2940 
atatcaatat aatgtgctgc cacagggatg 3000 
catgacaaaa atcttagagc ccttcagagc 3 060 
tatggatgac ttgtatgtag gatctgactt 312 0 
agagttaagg gaacatttat tgaaatgggg 318 0 
agaaccccca tttctttgga tggggtatga 3240 
tatactgctg ccagaaaagg atagttggac 33 00 
attaaactgg gcaagtcaga tttacccagg 3360 
caggggggcc aaagcactaa cagacatagt 342 0 
ggcagagaac agggaaattt taagagaacc 34 80 
agacttgata gctgaaatac agaaacaggg 354 0 
agaaccattt aaaaatctga aaacagggaa 3600 
tgatgtaaaa cagttaacag aggcagtgca 3660 
gggaaagact cctaaattta gactacccat 3720 
agactattgg caagccacct ggatccctga 3780 
aaaattatgg taccaactag aaaaagatcc 3 84 0 
tggagcaact aatagggaag ctaaaatagg 3 900 
gcagaaaatt gttactctaa ctaacacaac 3960 
gctagctctg caggattcag gatcagaagt 402 0 
aggaatcatt caagcacaac cagataagag 40 8 0 
acagttaata aacaaggaaa gaatctacct 414 0 
gggaaatgaa caagtagata aattagtaag 42 00 
tggaatagat aaagctcaag aagagcatga 4260 
taatgagttt aatctgccac ccatagtagc 432 0 
tcagctaaaa ggggaagcca tacatggaca 43 8 0 
agattgtacc catttagagg gaaaaatcat 444 0 
catggaagca gaggttatcc cagcagaaac 4500 
attagcagga agatggccag tcaaagtaat 4560 
tactgcagtt aaggcagcct gttggtgggc 462 0 
caatccccaa agtcagggag tggtagaatc 4680 
acaagtaaga gatcaagctg agcaccttaa 474 0 
caattttaaa agaaaagggg gaattggggg 4 8 00 
aatagcaaca gacatacaaa ctaaagaatt 4860 
tcgggtttat tacagagaca gcagagaccc 4 92 0 
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tatttggaaa ggaccagccg aactactctg gaaaggtgaa ggggtagtag taatagaaga 4980 
taaaggtgac ataaaggtag taccaaggag gaaagcaaaa atcattagag attatggaaa 5040 
acagatggca ggtgctgatt gtgtggcagg tggacaggat gaagattaga gcatggaata 5100 
gtttagtaaa gcaccatatg tatatatcaa ggagagctag tggatgggtc tacagacatc 5160 
attttgaaag cagacatcca aaagtaagtt cagaagtaca tatcccatta ggggatgcta 522 0 
gattagtaat aaaaacatat tggggtttgc agacaggaga aagagattgg catttgggtc 52 80 
atggagtctc catagaatgg agactgagag aatacagcac acaagtagac cctgacctgg 5340 
cagaccagct aattcacatg cattattttg attgttttac agaatctgcc ataagacaag 54 00 
ccatattagg acacatagtt tttcctaggt gtgactatca agcaggacat aagaaggtag 54 60 
gatctctgca atacttggca ctgacagcat tgataaaacc aaaaaagaga aagccacctc 552 0 
tgcctagtgt tagaaaatta gtagaggata gatggaacga cccccagaag accaggggcc 5580 
gcagagggaa ccatacaatg aatggacact agagattcta gaagaactca agcaggaagc 5640 
tgtcagacac tttcctagac catggctcca tagcttagga caatatatct atgaaaccta 5700 
tggggatact tggacgggag ttgaagctat aataagagta ctgcaacaac tactgttcat 5760 
tcatttcaga attggatgcc aacatagcag aataggcatc ttgcgacaga gaagagcaag 582 0 
aaatggagcc agtagatcct aaactaaagc cctggaacca tccaggaagc caacctaaaa 5 880 
cagcttgtaa taattgcttt tgcaaacact gtagctatca ttgtctagtt tgctttcaga 5940 
caaaaggttt aggcatttcc tatggcagga agaagcggag acagcgacga agcgctcctc 6000 
caagtggtga agatcatcaa aatcctctat caaagcagta agtacacata gtagatgtaa 6060 
tggtaagttt aagtttattt aaaggagtag attatagatt aggagtagga gcattgatag 6120 
tagcactaat catagcaata atagtgtgga ccatagcata tatagaatat aggaaattgg 6180 
taagacaaaa gaaaatagac tggttaatta aaagaattag ggaaagagca gaagacagtg 6240 
gcaatgagag tgatggggac acagaagaat tgtcaacaat ggtggatatg gggcatctta 63 00 
ggcttctgga tgctaatgat ttgtaacacg gaggacttgt gggtcacagt ctactatggg 6360 
gtacctgtgt ggagagaagc aaaaactact ctattctcjtg catcagatgc taaagcatat 6420 
gagacagaag tgcataatgt ctgggctaca catgcttgtg tacccacaga ccccaaccca 6480 
caagaaatag ttttgggaaa tgtaacagaa aattttaata tgtggaaaaa taacatggca 6540 
gatcagatgc atgaggatat aatcagttta tgggatcaaa gcctaaagcc atgtgtaaag 6600 
ttgaccccac tctgtgtcac tttaaactgt acagatacaa atgttacagg taatagaact 6660 
gttacaggta atacaaatga taccaatatt gcaaatgcta catataagta tgaagaaatg 6720 
aaaaattgct ctttcaatgc aaccacagaa ttaagagata agaaacataa agagtatgca 6780 
ctcttttata aacttgatat agtaccactt aatgaaaata gtaacaactt tacatataga 684 0 
ttaataaatt gcaatacctc aaccataaca caagcctgtc caaaggtctc ttttgacccg 6900 
attcctatac attactgtgc tccagctgat tatgcgattc taaagtgtaa taataagaca 6960 
ttcaatggga caggaccatg ttataatgtc agcacagtac aatgtacaca tggaattaag 702 0 
ccagtggtat caactcaact actgttaaat ggtagtctag cagaagaagg gataataatt 7080 
agatctgaaa atttgacaga gaataccaaa acaataatag tacatcttaa tgaatctgta 7140 
gagattaatt gtacaaggcc caacaataat acaaggaaaa gtgtaaggat aggaccagga 7200 
caagcattct atgcaacaaa tgacgtaata ggaaacataa gacaagcaca ttgtaacatt 7260 
agtacagata gatggaataa aactttacaa caggtaatga aaaaattagg agagcatttc 732 0 
cctaataaaa caataaaatt tgaaccacat gcaggagggg atctagaaat tacaatgcat 73 8 0 
agctttaatt gtagaggaga atttttctat tgcaatacat caaacctgtt taatagtaca 744 0 
tactacccta agaatggtac atacaaatac aatggtaatt caagcttacc catcacactc 750 0 
caatgcaaaa taaaacaaat tgtacgcatg tggcaagggg taggacaagc aatgtatgcc 7560 
cctcccattg caggaaacat aacatgtaga tcaaacatca caggaatact attgacacgt 7620 
gatgggggat ttaacaacac aaacaacgac acagaggaga cattcagacc tggaggagga 7680 
gatatgaggg ataactggag aagtgaatta tataaatata aagtggtaga aattaagcca 7740 
ttgggaatag cacccactaa ggcaaaaaga agagtggtgc agagaaaaaa aagagcagtg 7800 
ggaataggag ctgtgttcct tgggttcttg ggagcagcag gaagcactat gggcgcagcg 7860 
tcaataacgc tgacggtaca ggccagacaa ctgttgtctg gtatagtgca acagcaaagc 792 0 
aatttgctga aggctataga ggcgcaacag catatgttgc aactcacagt ctggggcatt 7980 
aagcagctcc aggcgagagt cctggctata gaaagatacc taaaggatca acagctccta 8040 
gggatttggg gctgctctgg aagactcatc tgcaccactg ctgtgccttg gaactccagt 8100 
tggagtaata aatctgaagc agatatttgg gataacatga cttggatgca gtgggataga 8160 
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gaaattaata attacacaga aacaatattc aggttgcttg aagactcgca aaaccagcag 8220 
gaaaagaatg aaaaagattt attagaattg gacaagtgga ataatctgtg gaattggttt 82 8 0 
gacatatcaa actggctgtg gtatataaaa atattcataa tgatagtagg aggcttgata 8340 
ggtttaagaa taatttttgc tgtgctctct atagtgaata gagttaggca gggatactca 8400 
cctttgtcat ttcagaccct taccccaagc ccgaggggac tcgacaggct cggaggaatc 8460 
gaagaagaag gtggagagca agacagagac agatccatac gattggtgag cggattcttg 852 0 
tcgcttgcct gggacgatct gcggagcctg tgcctcttca gctaccaccg cttgagagac 8580 
ttcatattaa ttgcagtgag ggcagtggaa cttctgggac acagcagtct caggggacta 8640 
cagagggggt gggagatcct taagtatctg ggaagtcttg tgcagtattg gggtctagag 870 0 
ctaaaaaaga gtgctattag tccgcttgat accatagcaa tagcagtagc tgaaggaaca 8760 
gataggatta tagaattggt acaaagaatt tgtagagcta tcctcaacat acctaggaga 8 82 0 
ataagacagg gctttgaagc agctttgcta taaaatggga ggcaagtggt caaaacgcag 8 8 80 
catagttgga tggcctgcag taagagaaag aatgagaaga actgagccag cagcagaggg 894 0 
agtaggagca gcgtctcaag acttagatag acatggggca cttacaagca gcaacacacc 9000 
tgctactaat gaagcttgtg cctggctgca agcacaagag gaggacggag atgtaggctt 9060 
tccagtcaga cctcaggtac ctttaagacc aatgacttat aagagtgcag tagatctcag 912 0 
cttcttttta aaagaaaagg ggggactgga agggttaatt tactctagga aaaggcaaga 9180 
aatccttgat ttgtgggtct ataacacaca aggcttcttc cctgattggc aaaactacac 9240 
atcggggcca ggggtccgat tcccactgac ctttggatgg tgcttcaagc tagtaccagt 93 0 0 
^ tgacccaagg gaggtgaaag aggccaatga aggagaagac aactgtttgc tacaccctat 9360 
^ gagccaacat ggagcagagg atgaagatag agaagtatta aagtggaagt ttgacagcct 942 0 
S tctagcacac agacacatgg cccgcgagct acatccggag tattacaaag actgctgaca 9480 
Jl cagaagggac tttccgcctg ggactttcca ctggggcgtt ccgggaggtg tggtctgggc 954 0 
^ gggacttggg agtggtcacc ctcagatgct gcatataagc agctgctttt cgcttgtact 9600 

□ gggtctctct cggtagacca gatctgagcc tgggagctct ctggctatct agggaaccca 9660 
d ctgcttaggc ctcaataaag cttgccttga gtgctctaag tagtgtgtgc ccatctgttg 972 0 
^ tgtgactctg gtaactagag atccctcaga ccctttgtgg tagtgtggaa aatctctagc 9780 
U ^ 9781 

L,^ <210> 34 

<211> 203 
y <212> DNA 

=J <213> Human immunodeficiency virus 
3 <400> 34 

□ gctgaggcaa tgagccaagc aaccagcgca aacatactga tgcagagaag caatttcaaa 60 
ggccctaaaa gaattattaa atgtttcaac tgtggcaagg aagggcacat agctagaaat 120 
tgtagggccc ctaggaaaaa aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa 180 
gactgtactg agaggcaggc taa 2 03 

<210> 35 
<211> 2151 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 35 

ttttttaggg aagatttggc cttcccacaa gggaaggcca gggaatttcc ttcagaacag 60 
aacagagcca acagccccac cagcagagag cttcaagttc gaggagacaa cccccgctcc 120 
gaagcaggag ccgaaagaca gggaaccctt aatttccctc aaatcactct ttggcagcga 180 
ccccttgtct caataaaagt agggggtcaa ataaaggagg ctctcttaga cacaggagct 240 
gatgatacag tattagaaga aatgagtttg ccaggaaaat ggaaaccaaa aatgatagga 300 
ggaattggag gttttatcaa agtaagacag tatgatcaaa tacttataga aatttgtgga 3 60 
aaaaaggcta taggtacagt attaatagga cctacacctg tcaacataat tggaaggaat 420 
atgttgactc agcttggatg cacactaaat tttccaatta gtcccattga aactgtgcca 480 
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gtaaaattaa agccaggaat ggatggccca aaggttaaac aatggccatt gacagaagag 540 
aaaataaaag cattaacagc aatttgtgaa gaaatggaga aagaaggaaa aattacaaaa 600 
attgggcctg aaaatccata taacactcca gtatttgcca taaaaaagaa ggacagtact 660 
aagtggagaa agttagtaga tttcagggaa cttaataaaa gaactcaaga cttttgggaa 720 
gttcaattag gaataccaca cccagcaggg ttaaaaaaga aaaaatcagt gacagtactg 780 
gatgtggggg atgcatattt ttcagttcct ttagatgagg acttcaggaa atatactgca 84 0 
ttcaccatac ctagtataaa caatgaaaca ccagggatta gatatcaata taatgtgctt 900 
ccacagggat ggaaaggatc accatcaata ttccagagta gcatgacaaa aatcttagag 960 
ccctttagag caagaaatcc agaaatagtc atctatcaat atatggatga cttgtatgta 1020 
ggatctgact tagaaatagg gcaacataga gcaaaaatag aggagttaag aaaacatctg 1080 
ttaaggtggg gatttaccac accggacaag aaacatcaga aagaaccccc atttctttgg 1140 
atggggtatg aactccatcc tgacaaatgg acagtacagc ctatagagtt gccagaaaag 1200 
gaaagctgga ctgtcaatga tatacagaag ttagtgggaa aattaaattg ggccagtcag 1260 
atttacccag gaattaaagt aaggcaactt tgtaaactcc ttaggggggc caaagcacta 1320 
acagatatag taccactaac tgaagaagca gaattagaat tggcagagaa cagggaaatt 13 80 
ctaagagaac cagtacatgg agtatattat gacccatcaa aagacttggt agctgaaata 1440 
cagaaacagg ggcatgacca atggacatat caaatttacc aagaaccatt caaaaacctg 1500 
aaaacaggga agtatgcaaa aatgaggact gcccacacta atgatgtaaa acagttaaca 1560 
gaggcagtgc aaaaaatagc tatggaaagc atagtaatat ggggaaagac tcctaaattt 1620 
agactaccca tccaaaaaga aacatgggag acatggtgga cagactattg gcaagccacc 1680 
tggattcctg agtgggagtt tgttaatacc cctcccttag taaaattatg gtaccagcta 1740 
gagaaagaac ccataatagg agcagaaact ttctatgtag atggagcagc taatagggaa 1800 
actaaaatag gaaaagcagg gtatgttact gacagaggaa ggcagaaaat tgtttctcta 1860 
acagaaacaa caaatcagaa gactgaatta caagcaattc agctagcttt gcaagattca 192 0 
ggatcagaag taaacatagt aacagactca cagtatgcat taggaatcat tcaagcacaa 1980 
ccagataaga gtgaatcaga gttagtcaac caaataatag aacaattaat aaaaaaggaa 2 04 0 
aaggtctacc tgtcatgggt accagcacat aaaggaattg gaggaaatga acaaatagat 2100 
aaattagtaa gtaagggaat caggaaagtg ctgtttctag atggaataga t 2151 

<210> 36 
<211> 54 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 36 

ggcggcatcg tgatctacca gtacatggac gacctgtacg tgggcagcgg cggc 54 

<210> 37 
<211> 18 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 37 

Gly Gly He Val He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser 
1 5 .10 15 

Gly Gly 



<210> 38 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
SlFCSacTA 

<400> 38 

gtttcttgag ctctggaagg gttaatttac tccaagaa 

<210> 39 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
SlFTSacTA 

<400> 39 

gtttcttgag ctctggaagg gttaatttac tctaagaa 

<210> 40 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S145RTSalTA 

<400> 40 

gtttcttgtc gacttgtcca tgtatggctt cccct 

<210> 41 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S145RCSalTA 

<400> 41 

gtttcttgtc gacttgtcca tgcatggctt ccct 

<210> 42 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S245FASalTA 

<400> 42 

gtttcttgtc gactgtagtc caggaatatg gcaattag 
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<210> 43 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S245FGSalTA 

<400> 43 

gtttcttgtc gactgtagtc cagggatatg gcaattag 

<210> 44 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S2FullNotTA 

<400> 44 

gtttcttgcg gccgctgcta gagattttcc acactacca 




<210> 45 
<211> 9738 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 45 

tggaagggtt aatttactcc aggaaaaggc aagagatcct tgatttatgg gtctatcaca 60 
cacaaggcta cttccctgat tggcaaaact acacaccggg accaggggtc agatatccac 12 0 
tgacctttgg atggtgcttc aagctagtgc cagttgaccc aagggaagta gaagaggcca 180 
acggaggaga agacaactgt ttgctacacc ctatgagcca gtatggaatg gatgatgaac 240 
acaaagaagt gttacagtgg aagtttgaca gcagcctagc acgcagacac ctggcccgcg 3 00 
agctacatcc ggattattac aaagactgct gacacagaag ggactttccg cctgggactt 3 60 
tccactgggg cgttccaggg ggagtggtct gggcgggact gggagtggcc agccctcaga 420 
tgctgcatat aagcagcggc ttttcgcctg tactgggtct ctctaggtag accagatccg 480 
agcctgggag ctctctgtct atctggggaa cccactgctt aggcctcaat aaagcttgcc 540 
ttgagtgctc taagtagtgt gtgcccatct gttgtgtgac tctggtaact ctggtaacta 600 
gagatccctc agaccctttg tggtagtgtg gaaaatctct agcagtggcg cccgaacagg 660 
gacttgaaag cgaaagtgag accagagaag atctctcgac gcaggactcg gcttgctgaa 72 0 
gtgcactcgg caagaggcga ggggggcgac tggtgagtac gccaaaattt tttttgacta 780 
gcggaggcta gaaggagaga gatgggtgcg agagcgtcaa tattaagagg gggaaaatta 840 
gacaaatggg aaaaaattag gttacggcca ggggggagaa aacactatat gctaaaacac 900 
ctagtatggg caagcagaga gctggaaaga tttgcagtta accctggcct tttagagaca 960 
tcagacggat gtagacaaat aataaaacag ctacaaccag ctcttcagac aggaacagag 1020 
gaaattagat cattatttaa cacagtagca actctctatt gtgtacataa agggatagat 1080 
gtacgagaca ccaaggaagc cttagacaag atagaggagg aacaaaacaa atgtcagcaa 114 0 
aaaacacagc aggcggaagc ggctgacaaa aaggtcagtc aaaattatcc tatagtgcag 1200 
aacctccaag ggcaaatggt acaccaggcc atatcaccta gaaccttgaa tgcatgggta 1260 
aaagtaatag aggagaaggc ttttagccca gaggtaatac ccatgtttac agcattatca 1320 
gaaggagcca ccccacaaga tttaaacacc atgttaaata cagtgggggg acatcaagca 13 80 
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gccatgcaaa tgttaaaaga taccatcaat 
ccagtacatg cagggcctgt tgcaccaggc 
gcaggaacta ctagtaccct tcaagaacaa 
ccagtagggg acatctataa aaggtggata 
tacagccctg tcagcatttt agacataaaa 
gtagaccggt tcttcaaaac tttaagagct 
atgacagaca ccttgttagt ccaaaatgcg 
ttaggaccag gggcttcatt agaagaaatg 
agccacaaag caagagtttt ggctgaggca 
atacagaaaa gcaattttaa aggccctaga 
gaagggcaca tagccaggaa ttgcagggcc 
aaggaaggac accaaatgaa agactgtact 
tggccttccc acaaggggag gccagggaat 
ccaccactag aaccaacagc cccaccagca 
caggagccga aagacaggga acctttaact 
ttgtctcaat aaaagtagcg ggccaaacaa 
atacagtact agaagaaata aacttgccag 
ttggaggttt tatcaaagta agacagtatg 
gggctatagg tacagtatta gtaggaccta 
tgactcagct tggatgcaca ctaaattttc 
aattaaagcc aggaatggat ggcccaaagg 
taaaagcatt aacagaaatt tgtgaggaaa 
ggcctgaaaa tccatataac actccagtat 
ggagaaaatt agtagatttc agggaactca 
aattaggaat accacaccca gcagggttaa 
tgggagatgc atatttttca gtccctttag 
ccatacctag tataaacaat gaaacaccag 
^gggatggaa aggatcacca gcaatattcc 
ttagaacaca aaacccagaa gtagttatct 
ctgacttaga aatagggcaa catagagcaa 
^^tggggatt taccacacca gacaagaaac 
ggtatgaact ccatcctgac aaatggacag 
gctggactgt caatgatata cagaagttag 
acccagggat taaagtaagg caactgtgta 
acatagtgcc actgactgaa gaagcagaat 
aagaaccagt acatggagta tattatgacc 
aacaggggaa tgaccaatgg acatatcaaa 
caggaaagta tgcaaaaatg aggactgccc 
cagtgcaaaa gataacccag gaaagcatag 
tacccatccc aaaagaaaca tgggagacat 
ttcctgagtg ggagtttgtc aatacccctc 
aagaacccat agtaggggca gaaactttct 
aaataggaaa agcagggtat gtcactgaca 
aaacaacaaa tcagaagact gaattacaag 
cagaagtaaa catagtaaca gactcacagt 
ataagagtga atcagaatta gtcagtcaaa 
tctacctatc atgggtacca gcacataaag 
tagtaagtag tggaatcaga aaagtactgt 
agcatgaaaa atatcacagc aattggagag 
tagtagcaaa ggaaatagta gccagctgtg 
atggacaagt cgactgtagt ccaggaatat 
aaatcatcct agtagcagtc catgtagcca 
cagaaacagg acaagaaaca gcatacttta 
aagtaataca tacagataat ggcagtaatt 



gaggaggctg cagaatggga taggttacat 144 0 
cagatgagag aaccaagggg aagtgacata 1500 
atagcatgga tgacaagtaa cccacctatc 1560 
attctggggt taaataaaat agtaagaatg 1620 
caaggaccaa aggaaccctt tagagactat 1680 
gaacaatcta cacaagaggt aaaaaattgg 174 0 
aacccagatt gtaagaccat tttaagagca 1800 
atgacagcat gtcagggagt gggaggacct 1860 
atgagccaag caaacaatac aagtgtaatg 1920 
agagctgtta aatgtttcaa ctgtggcagg 1980 
cctaggaaaa ggggctgttg gaaatgtgga 2 04 0 
gagaggcagg ctaatttttt agggaaaatt 2100 
ttccttcaga gcagaccaga gccaacagcc 2160 
gagagcttca agttcaagga gactccgaag 2220 
tccctcaaat cactctttgg cagcgacccc 2280 
aggaggctct tttagataca ggagcagatg 2340 
gaaaatggaa accaaaaatg ataggaggaa 24 00 
atcaaatact tatagaaatt tgtggaaaaa 2460 
cacctgtcaa cataattgga agaaatctgt 2520 
caattagccc cattgaaact gtaccagtaa 2580 
ttaaacaatg gccattgaca gaagaaaaaa 2 64 0 
tggagaagga aggaaaaatt acaaaaattg 2 700 
ttgccataaa gaagaaggac agtacaaagt 2760 
ataaaagaac tcaagacttt tgggaagtcc 2 82 0 
aaaagaaaaa atcagtgaca gtactggatg 2880 
atgagagctt cagaaaatat actgcattca 2 94 0 
ggattagata tcaatataat gttcttccac 3000 
agagtagcat gacaagaatc ttagagccct 3 060 
atcaatatat ggatgactta tatgtaggat 312 0 
aaatagagga gttaagagga cacctattga 3180 
atcagaaaga acccccattt ctttggatgg 3240 
tacagcctat acagctgcca gaaaaggaga 33 00 
tgggaaagtt aaactgggca agtcagattt 3 3 60 
aactccttag gggagccaaa gcactaacag 3420 
tagaattggc tgagaacagg gaaattctaa 3480 
catcaaaaga tttaatagct gaaatacaga 3540 
tttaccaaga accatttaaa aatctgagaa 3600 
acactaatga tgtgaaacag ttagcagagg 3660 
taatatgggg aaaaactcct aaatttagac 3 72 0 
ggtggtcaga ctattggcaa gccacctgga 3780 
ccctagtaaa attgtggtac cagctggaaa 3 84 0 
atgtagatgg agcagccaat agggaaacta 3 90 0 
aaggaaggca gaaagttgtt tccttcactg 3 960 
caattcagct agctttgcag gattcagggc 4020 
atgcattagg aatcattcaa gcacaaccag 4080 
taatagaaca gttgataaaa aaggaaaaag 4140 
gaattggagg aaatgaacaa gtagacaaat 4200 
ttctagatgg aatagataaa gctcaagaag 4260 
caatggctag tgagtttaat ctgccaccca 4320 
ataaatgtca gctaaaaggg gaagccatgc 4380 
ggcaattaga ctgtacacat ttagaaggaa 4440 
gtggctacat ggaagcagag gttatcccag 4500 
tactaaaatt agcaggaaga tggccagtca 4560 
tcaccagtac cgcagttaag gcagcctgtt 4620 
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4 

99tgggcaga tatccaacgg gaatttggaa 
tagaatccat gaataaagaa ttaaagaaaa 
accttaagac agcagtacaa atggcagtat 
ttggggggta cagtgcaggg gagagaataa 
aagaattaca aaaacaaatt ataaaaattc 
gagaccctat ttggaaagga ccagccaaac 
tacaagataa tagtgatata aaggtagtac 
atggaaaaca gatggcaggt gctgattgtg 
tggcacagtt tagtaaagca ccatatgtat 
agacatcatt atgaaagcag acacccaaaa 
gatgccaggt tagtaataaa aacatattgg 
ttgggtcacg gagtctccat agaatggaga 
gacctgacag accaactaat tcatatgcat 
aggaaagcca tactaggaca gatagttagc 
aaggtaggat ctctacaata cttggcactg 
ccacctctgc ctagtgttag gaaattagta 
aggggccgca gagggaacca tacaatgaat 
aggaagctgt cagacacttt cctagaccat 
aaacctatgg agatacttgg acaggagttg 
tgtttattca tttcaggatt gggtgccatc 
gagcaagaaa tggagccaat agatcctaac 
cctaaaactg cttgtaatgg gtgttactgt 
tttcagaaaa aaggcttagg catttactat 
gctcctccaa gcaataaaga tcatcaagat 
atatgtaatg ttagatttaa ctgcaagaat 
agtagcacta atcatagcaa taatagtgtg 
ggtaaggcaa aggaaaatag actggttagt 
tggcaatgag agcgaggggg atactgaaga 
taggcttttg gatgctaatg atgtgtaatg 
gggtacctgt ggggagagaa gcaaaaacta 
atgagaaaga agtgcataat gtctgggcta 
cacaagaagt gattttgggc aatgtaacag 
tggatcagat gcaggaagat ataatcagtt 
aattgacccc actctgtgtc actttaaact 
cctctaaaga catgaaaaat tgctctttct 
agaaagaaaa tgcacttttt tatagacttg 
ggaatattaa caactataga ttaataaatt 
caaaagtctc gtttgaccca attcctatac 
taaaatgtaa taataagaaa ttcaatggaa 
aatgtacaca tggaattaag ccagtggtat 
cagaagaaga gataataatt agatctgaaa 
tacatcttaa tgaatctata gagattaaat 
gtgtgagaat aggaccagga caagcattct 
gacaagcaca ttgtaacatt agtaaaaatg 
aaaaattaca agaactcttc cctaatagta 
gggacctaga aattactaca catagcttta 
caacagacct gtttaatagt acatacagta 
ctaataatac agagcgcatc acactccaat 
aggaggtagg acgagcaatg tatgcccctc 
atattacagg actactatta acacgtgatg 
tcagacctgg aggaggagac atgagggaca 
tggtagaaat taaaccatta ggagtagcac 
gagaaaaaag agcagtagga ataggagctg 
gcactatggg cgcagcatca ataacgctga 



• 

ttccctacaa tccccaaagt caaggagtag 4 68 0 
tcatagggca agtaagagat caagctgagc 4 74 0 
tcattcacaa ttttaaaaga aaagggggga 4 80 0 
tagacataat agcatcagac atacaaacta 4860 
aaaattttcg ggtttattac agagacagca 4 92 0 
tactctggaa aggtgaaggg gcagtagtaa 4980 
caagaaggaa agcaaaaatc attaaggact 504 0 
tggcaggtag acaggatgaa gattagaaca 5100 
gtttcgagga gagctgatgg atggttctac 5160 
gtaagttcag aagtacacat cccattagga 5220 
ggtctgcaga caggagaaag agcttggcat 52 8 0 
ttgagaagat atagcacaca agtagaccct 5340 
tattttgatt gttttgcaga atctgccata 5400 
cctaagtgtg actatcaagc aggacataac 5460 
acagcattga taaaaccaaa aaagataaag 552 0 
gaggatagat ggaacaagcc ccagaagacc 5580 
ggacactaga gcttttagaa gaactcaagc 5640 
ggctccataa cttaggacaa catatctatg 5700 
aagcaataat aagaatcctg caacaattac 5760 
atagcagaat aggcattttg cgacagagaa 582 0 
ctagaaccct ggaaccatcc aggaagtcag 5880 
aaacgttgca gctatcattg tctagtttgc 5940 
ggcaggaaga agcggagaca gcgacgaagc 600 0 
cctctaccaa agcagtaagt accgaatagt 6060 
agattctaga ttaggaatag gagcattgat 6120 
gaccatagta tatatagaat ataggaaatt 6180 
taaaaggatt agggaaagag cagaagacag 624 0 
attatcgaca ctggtggata tggggcatct 63 0 0 
tgaagggctt gtgggtcaca gtctactacg 6360 
ctctattttg tgcatcagat gctaaagcat 6420 
cacatgcctg tgtacccaca gaccccaacc 6480 
aaaattttaa catgtggaaa aatgacatgg 654 0 
tatgggatca aagccttaag ccatgtgtaa 6600 
gtacaaatgc aactgttaac tacaataata 6660 
atgtaaccac agaattaaga gataagaaaa 672 0 
atatagtacc acttaataat aggaagaatg 6780 
gtaatacctc agccataaca caagcctgtc 6840 
attattgtgc tccagctggt tatgcgcctc 6900 
taggaccatg cgataatgtc agcacagtac 6960 
caactcaatt actgttaaat ggtagcctag 7020 
atctgacaaa caatgtcaaa acaataatag 7080 
gtacaagacc tggcaataat acaagaaaga 714 0 
atgcaacagg agacataata ggagatataa 7200 
aatggaatac aactttacaa agggtaagtc 7260 
cagggataaa atttgcacca cactcaggag 7320 
attgtggagg agaatttttc tattgcaata 7380 
atggtacatg cactaatggt acatgcatgt 7440 
gcagaataaa acaaattata aacatgtggc 7500 
ccattgcagg aaacataaca tgtagatcaa 7560 
gaggagataa taatactgaa acagagacat 7620 
attggagaag tgaattatat aaatacaagg 7680 
ccactgctgc aaaaaggaga gtggtggaga 7740 
tgttccttgg gttcttggga gcagcaggaa 7800 
cggtacaggc cagacaatta ttgtctggta 7 860 
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tagtgcaaca gcaaagtaat ttgctgaggg 
tcacggtctg gggcattaag cagctccagg 
aggatcaaca gctcctagga ctgtggggct 
tgctttggaa ctctagttgg agtaataaaa 
ggatgcagtg ggatagggaa attagtaatt 
actcgcaaag ccagcaggaa agaaatgaaa 
atctgtggaa ttggtttagc ataacaaatt 
tagtaggagg cttgataggt ttaagaataa 
ttaggcaggg atactcaccc ttgtcattgc 
acaggctcgg aggaatcgaa gaagaaggtg 
tagtgagcgg attcttgaca cttgcctggg 
accaccgatt gagagacttc atattaattg 
gtagtctcag gggactgcag agggggtggg 
aatattgggg tctagagtta aaaaagagtg 
cagtagctga aggaacagat aggattctag 
gcaacgtacc tagaagaata agacagggct 
aagtggtcaa aaagcagtat aattggatgg 
aggtcagcag cagagggagt aggatcagcg 
acaaccagca acacagccca caacaatgct 
gaaggagaag taggctttcc agtcagacct 
gcagcaatag atctcagctt ctttttaaaa 
tccaagaaaa ggcaagagat ccttgatttg 
gattggcaaa actacacacc gggaccaggg 
ttcaagctag agccagtcga tccaagggaa 
tgtttactac accctatgag ccagcatgga 
tggaagtttg acagtacgct agcacgcaga 
tacaaagact gctgacacag aagggacttt 
gaggtgtggt ctgggcggga caggggagtg 
tgcttttcgc ctgtactggg tctctctagg 
gctatctagg gaacccactg cttaagcctc 
tgtgtgcccg tctgttgtgt gactctggta 
tgtggaaaat ctctagca 



ctatagaggc gcaacagcat atgttgcaac 7920 
caagagtcct ggctatagag agatacctac 7980 
gctctggaaa actcatctgc accactaatg 8040 
ctcaaagtga tatttgggat aacatgacct 8100 
acacaaacac aatatacagg ttgcttgaag 8160 
aagatttact agcattggac aggtggaaca 8220 
ggctgtggta tataaaaata ttcataatga 8280 
tttttgctgt gctctctcta gtaaatagag 8340 
agacccttat cccaaacccg aggggacccg 8400 
gagagcaaga cagcagcaga tccattcgat 8460 
acgacctacg aagcctgtgc ctcttctgct 8520 
tagtgagagc agtggaactt ctgggacaca 8580 
gaacccttaa gtatttgggg agtcttgtgc 8640 
ctattaatct gcttgatact atagcaatag 8700 
aattcataca aaacctttgt agaggtatcc 8760 
tcgaagcagc tttgcaataa aatggggggc 882 0 
cctgaagtaa gagaaagaat cagacgaact 8880 
tctcaagact tagagaaaca tggggcactt 894 0 
gcttgcgcct ggctggaagc gcaagaggag 9000 
caggtacctt taagaccaat gacttataaa 9060 
gaaaaggggg gactggaagg gttaatttac 912 0 
tgggtttata acacacaagg cttcttccct 9180 
gtcagatttc cactgacctt tggatggtac 924 0 
gtagaagagg ccaatgaagg agaaaacaac 93 00 
atggaggatg aagacagaga agtattaaga 93 60 
cacatggccc gcgagctaca tccggagtat 942 0 
ccgctgggac tttccactgg ggcgttccag 9480 
gtcagccctg agatgctgca tataagcagc 954 0 
tagaccagat ctgagcccgg gagctctctg 9600 
aataaagctt gccttgagtg ccttgagtag 9660 
actagagatc cctcagacca cttgtggtag 9720 

9738 
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